
stable-diffusion-v1-5
Image
Stable Diffusion v1.5 (≈ 1 B params, CreativeML Open RAIL-M)
Text-to-image generator that fits on a single consumer GPU.
What it is. Latent-diffusion model that turns plain-English prompts into 512 × 512 images; upscale or edit after if you need bigger.
Under the hood. ~860 M-parameter U-Net + 123 M CLIP ViT-L/14 text encoder run in latent space—lighter and faster than pixel-space diffusion.
Runs on cheap hardware. Needs about 5-8 GB VRAM for FP16 inference; 4-bit or CPU builds will run on 4 GB cards or laptops, just slower..
Drop-in tooling. One-liner with diffusers, plus full support in Automatic1111, ComfyUI, SD.Next, InvokeAI, etc.—same workflow as any SD-1.x model.
License quirks. CreativeML Open RAIL-M lets you use it commercially but bans porn, extreme gore, hateful or copyright-breaking content.
Why pick it for Norman AI?
We can auto-generate hero art, social banners, or quick mock-ups on a single A10G or even a beefy MacBook. No vendor lock-in, no extra infra—just load the weights, point our brand-prompt template at it, and ship visuals fast.
