1-Bit Bonsai Image 4B Image Generation for Local Devices

79 points - today at 3:04 PM

Source

Comments

potatoman22 today at 4:58 PM
I wonder why they didn't use a Bonsai model as the text encoder
sorenjan today at 3:38 PM
They call it a diffusion model, but it's based on Flux.2 which is a rectified flow model.
a1o today at 4:38 PM
Anyone could pickup the minimal hardware requirements for this? Like both RAM and Storage?
lumost today at 4:27 PM
I actually canโ€™t wait for the future where I upgrade hardware in order to upgrade my ai as an alternative to an expensive subscription.

There are many problems I want to work on which require billions of tokens. These are completely inaccessible without corporate project sponsorship at the moment. An asic generation machine which can pump out a few 10s of thousands of tokens per second at opus4.6 quality is more than sufficient.

janniks today at 4:56 PM
I was expecting to see images of Bonsai trees when I clicked this
wiradikusuma today at 4:40 PM
Is there a benchmark of local image generation models? Local = can run on a 16 GB MacBook or 8 GB+ NVIDIA card.
MitPitt today at 3:38 PM
Lately I've noticed posts with barely 10 points getting to HN frontpage. Was it always like this?
SilentM68 today at 4:38 PM
Question,

Is it compatible with Ollama, ComfyUI or are those providers unneeded, compatible with low-end hardware?

Also, where does "./setup.sh/ drop the components in Linux?

Thank you, Sol

yieldcrv today at 3:52 PM
impressive, combines a couple techniques that I always wanted the frontier models to have

having trouble loading the webgl browser demo on my phone but no biggy