Models

Browse models and install them with the pull command. You can also register any Hugging Face model into your Lemonade Server with the advanced pull command options.

Showing All labels All recipes 148 models

Hot models

Recommended models. Full supported catalog is listed below.

Qwen3-Coder-30B-A3B-Instruct-GGUF

codingtool-callinghot
lemonade-server pull Qwen3-Coder-30B-A3B-Instruct-GGUF

Qwen3-Coder-Next-GGUF

codingtool-callinghot
lemonade-server pull Qwen3-Coder-Next-GGUF

Gemma-4-26B-A4B-it-GGUF

hottool-callingvisionllamacpp
lemonade-server pull Gemma-4-26B-A4B-it-GGUF

Gemma-4-31B-it-GGUF

hottool-callingvisionllamacpp
lemonade-server pull Gemma-4-31B-it-GGUF

Qwen3.5-4B-GGUF

visiontool-callinghot
lemonade-server pull Qwen3.5-4B-GGUF

Qwen3.5-35B-A3B-GGUF

visiontool-callinghot
lemonade-server pull Qwen3.5-35B-A3B-GGUF

Qwen3-0.6B-GGUF

reasoning
lemonade-server pull Qwen3-0.6B-GGUF
Checkpointunsloth/Qwen3-0.6B-GGUF
GGUF VariantQ4_0
Recipellamacpp
Size (GB)0.38

Qwen3-1.7B-GGUF

reasoning
lemonade-server pull Qwen3-1.7B-GGUF
Checkpointunsloth/Qwen3-1.7B-GGUF
GGUF VariantQ4_0
Recipellamacpp
Size (GB)1.06

Qwen3-4B-GGUF

reasoning
lemonade-server pull Qwen3-4B-GGUF
Checkpointunsloth/Qwen3-4B-GGUF
GGUF VariantQ4_0
Recipellamacpp
Size (GB)2.38

Qwen3-8B-GGUF

reasoning
lemonade-server pull Qwen3-8B-GGUF
Checkpointunsloth/Qwen3-8B-GGUF
GGUF VariantQ4_1
Recipellamacpp
Size (GB)5.25

Qwen3-14B-GGUF

reasoning
lemonade-server pull Qwen3-14B-GGUF
Checkpointunsloth/Qwen3-14B-GGUF
GGUF VariantQ4_0
Recipellamacpp
Size (GB)8.54

Qwen3-4B-Instruct-2507-GGUF

tool-calling
lemonade-server pull Qwen3-4B-Instruct-2507-GGUF
Checkpointunsloth/Qwen3-4B-Instruct-2507-GGUF
GGUF VariantQwen3-4B-Instruct-2507-Q4_K_M.gguf
Recipellamacpp
Size (GB)2.5

Qwen3-30B-A3B-GGUF

reasoning
lemonade-server pull Qwen3-30B-A3B-GGUF
Checkpointunsloth/Qwen3-30B-A3B-GGUF
GGUF VariantQ4_0
Recipellamacpp
Size (GB)17.4

Qwen3-30B-A3B-Instruct-2507-GGUF

tool-calling
lemonade-server pull Qwen3-30B-A3B-Instruct-2507-GGUF
Checkpointunsloth/Qwen3-30B-A3B-Instruct-2507-GGUF
GGUF VariantQwen3-30B-A3B-Instruct-2507-Q4_0.gguf
Recipellamacpp
Size (GB)17.4

Qwen3-Coder-30B-A3B-Instruct-GGUF

codingtool-callinghot
lemonade-server pull Qwen3-Coder-30B-A3B-Instruct-GGUF
Checkpointunsloth/Qwen3-Coder-30B-A3B-Instruct-GGUF
GGUF VariantQwen3-Coder-30B-A3B-Instruct-Q4_K_M.gguf
Recipellamacpp
Size (GB)18.6

Qwen3-Coder-Next-GGUF

codingtool-callinghot
lemonade-server pull Qwen3-Coder-Next-GGUF
Checkpointunsloth/Qwen3-Coder-Next-GGUF
GGUF VariantQwen3-Coder-Next-MXFP4_MOE.gguf
Recipellamacpp
Size (GB)43.7

Nemotron-3-Nano-30B-A3B-GGUF

lemonade-server pull Nemotron-3-Nano-30B-A3B-GGUF
Checkpointunsloth/Nemotron-3-Nano-30B-A3B-GGUF
GGUF VariantNemotron-3-Nano-30B-A3B-UD-Q4_K_XL.gguf
Recipellamacpp
Size (GB)22.8

Gemma-3-4b-it-GGUF

vision
lemonade-server pull Gemma-3-4b-it-GGUF
Checkpointggml-org/gemma-3-4b-it-GGUF
GGUF VariantQ4_K_M
Mmprojmmproj-model-f16.gguf
Recipellamacpp
Size (GB)3.61

Gemma-4-26B-A4B-it-GGUF

hottool-callingvisionllamacpp
lemonade-server pull Gemma-4-26B-A4B-it-GGUF
Checkpointunsloth/gemma-4-26B-A4B-it-GGUF
GGUF VariantUD-Q4_K_M
Mmprojmmproj-F16.gguf
Recipellamacpp
Size (GB)16.9

Gemma-4-31B-it-GGUF

hottool-callingvisionllamacpp
lemonade-server pull Gemma-4-31B-it-GGUF
Checkpointunsloth/gemma-4-31B-it-GGUF
GGUF VariantQ4_K_M
Mmprojmmproj-F16.gguf
Recipellamacpp
Size (GB)18.3

Gemma-4-E4B-it-GGUF

tool-callingvisionllamacpp
lemonade-server pull Gemma-4-E4B-it-GGUF
Checkpointunsloth/gemma-4-E4B-it-GGUF
GGUF VariantQ4_K_M
Mmprojmmproj-F16.gguf
Recipellamacpp
Size (GB)5

Gemma-4-E2B-it-GGUF

tool-callingvisionllamacpp
lemonade-server pull Gemma-4-E2B-it-GGUF
Checkpointunsloth/gemma-4-E2B-it-GGUF
GGUF VariantQ4_K_M
Mmprojmmproj-F16.gguf
Recipellamacpp
Size (GB)3.1

Phi-4-mini-instruct-GGUF

lemonade-server pull Phi-4-mini-instruct-GGUF
Checkpointunsloth/Phi-4-mini-instruct-GGUF
GGUF VariantPhi-4-mini-instruct-Q4_K_M.gguf
Recipellamacpp
Size (GB)2.49

LFM2-1.2B-GGUF

lemonade-server pull LFM2-1.2B-GGUF
CheckpointLiquidAI/LFM2-1.2B-GGUF
GGUF VariantLFM2-1.2B-Q4_K_M.gguf
Recipellamacpp
Size (GB)0.731

LFM2.5-1.2B-Instruct-GGUF

lemonade-server pull LFM2.5-1.2B-Instruct-GGUF
CheckpointLiquidAI/LFM2.5-1.2B-Instruct-GGUF
GGUF VariantLFM2.5-1.2B-Instruct-Q4_K_M.gguf
Recipellamacpp
Size (GB)0.731

Jan-nano-128k-GGUF

lemonade-server pull Jan-nano-128k-GGUF
CheckpointMenlo/Jan-nano-128k-gguf
GGUF Variantjan-nano-128k-Q4_K_M.gguf
Recipellamacpp
Size (GB)2.5

Jan-v1-4B-GGUF

lemonade-server pull Jan-v1-4B-GGUF
Checkpointjanhq/Jan-v1-4B-GGUF
GGUF VariantJan-v1-4B-Q4_K_M.gguf
Recipellamacpp
Size (GB)2.5

Llama-3.2-1B-Instruct-GGUF

lemonade-server pull Llama-3.2-1B-Instruct-GGUF
Checkpointunsloth/Llama-3.2-1B-Instruct-GGUF
GGUF VariantLlama-3.2-1B-Instruct-UD-Q4_K_XL.gguf
Recipellamacpp
Size (GB)0.834

Llama-3.2-3B-Instruct-GGUF

lemonade-server pull Llama-3.2-3B-Instruct-GGUF
Checkpointunsloth/Llama-3.2-3B-Instruct-GGUF
GGUF VariantLlama-3.2-3B-Instruct-UD-Q4_K_XL.gguf
Recipellamacpp
Size (GB)2.06

SmolLM3-3B-GGUF

lemonade-server pull SmolLM3-3B-GGUF
Checkpointunsloth/SmolLM3-3B-128K-GGUF
GGUF VariantSmolLM3-3B-128K-UD-Q4_K_XL.gguf
Recipellamacpp
Size (GB)1.94

Ministral-3-3B-Instruct-2512-GGUF

vision
lemonade-server pull Ministral-3-3B-Instruct-2512-GGUF
Checkpointmistralai/Ministral-3-3B-Instruct-2512-GGUF
GGUF VariantMinistral-3-3B-Instruct-2512-Q4_K_M.gguf
MmprojMinistral-3-3B-Instruct-2512-BF16-mmproj.gguf
Recipellamacpp
Size (GB)2.85

Qwen2.5-VL-7B-Instruct-GGUF

vision
lemonade-server pull Qwen2.5-VL-7B-Instruct-GGUF
Checkpointggml-org/Qwen2.5-VL-7B-Instruct-GGUF
GGUF VariantQ4_K_M
Mmprojmmproj-Qwen2.5-VL-7B-Instruct-f16.gguf
Recipellamacpp
Size (GB)4.68

Qwen2.5-VL-3B-Instruct-GGUF

vision
lemonade-server pull Qwen2.5-VL-3B-Instruct-GGUF
Checkpointggml-org/Qwen2.5-VL-3B-Instruct-GGUF
GGUF VariantQ4_K_M
Mmprojmmproj-Qwen2.5-VL-3B-Instruct-f16.gguf
Recipellamacpp
Size (GB)3.27

Qwen3-VL-4B-Instruct-GGUF

vision
lemonade-server pull Qwen3-VL-4B-Instruct-GGUF
CheckpointQwen/Qwen3-VL-4B-Instruct-GGUF
GGUF VariantQ4_K_M
Mmprojmmproj-Qwen3VL-4B-Instruct-F16.gguf
Recipellamacpp
Size (GB)3.33

Qwen3-VL-8B-Instruct-GGUF

vision
lemonade-server pull Qwen3-VL-8B-Instruct-GGUF
CheckpointQwen/Qwen3-VL-8B-Instruct-GGUF
GGUF VariantQ4_K_M
Mmprojmmproj-Qwen3VL-8B-Instruct-F16.gguf
Recipellamacpp
Size (GB)6.19

Qwen3-Next-80B-A3B-Instruct-GGUF

tool-calling
lemonade-server pull Qwen3-Next-80B-A3B-Instruct-GGUF
Checkpointunsloth/Qwen3-Next-80B-A3B-Instruct-GGUF
GGUF VariantQwen3-Next-80B-A3B-Instruct-UD-Q4_K_XL.gguf
Recipellamacpp
Size (GB)45.1

Qwen3.5-0.8B-GGUF

visiontool-calling
lemonade-server pull Qwen3.5-0.8B-GGUF
Checkpointunsloth/Qwen3.5-0.8B-GGUF
GGUF VariantQwen3.5-0.8B-UD-Q4_K_XL.gguf
Mmprojmmproj-F16.gguf
Recipellamacpp
Size (GB)0.56

Qwen3.5-2B-GGUF

visiontool-calling
lemonade-server pull Qwen3.5-2B-GGUF
Checkpointunsloth/Qwen3.5-2B-GGUF
GGUF VariantQwen3.5-2B-UD-Q4_K_XL.gguf
Mmprojmmproj-F16.gguf
Recipellamacpp
Size (GB)1.34

Qwen3.5-4B-GGUF

visiontool-callinghot
lemonade-server pull Qwen3.5-4B-GGUF
Checkpointunsloth/Qwen3.5-4B-GGUF
GGUF VariantQwen3.5-4B-UD-Q4_K_XL.gguf
Mmprojmmproj-F16.gguf
Recipellamacpp
Size (GB)2.91

Qwen3.5-9B-GGUF

visiontool-calling
lemonade-server pull Qwen3.5-9B-GGUF
Checkpointunsloth/Qwen3.5-9B-GGUF
GGUF VariantQwen3.5-9B-UD-Q4_K_XL.gguf
Mmprojmmproj-F16.gguf
Recipellamacpp
Size (GB)5.97

Qwen3.5-35B-A3B-GGUF

visiontool-callinghot
lemonade-server pull Qwen3.5-35B-A3B-GGUF
Checkpointunsloth/Qwen3.5-35B-A3B-GGUF
GGUF VariantQwen3.5-35B-A3B-UD-Q4_K_XL.gguf
Mmprojmmproj-F16.gguf
Recipellamacpp
Size (GB)19.7

Qwen3.5-122B-A10B-GGUF

visiontool-callinghot
lemonade-server pull Qwen3.5-122B-A10B-GGUF
Checkpointunsloth/Qwen3.5-122B-A10B-GGUF
GGUF VariantUD-Q4_K_XL
Mmprojmmproj-F16.gguf
Recipellamacpp
Size (GB)68.4

Qwen3.5-27B-GGUF

visiontool-calling
lemonade-server pull Qwen3.5-27B-GGUF
Checkpointunsloth/Qwen3.5-27B-GGUF
GGUF VariantQwen3.5-27B-UD-Q4_K_XL.gguf
Mmprojmmproj-F16.gguf
Recipellamacpp
Size (GB)16.7

nomic-embed-text-v1-GGUF

embeddings
lemonade-server pull nomic-embed-text-v1-GGUF
Checkpointnomic-ai/nomic-embed-text-v1-GGUF
GGUF VariantQ4_K_S
Recipellamacpp
Size (GB)0.0781

Qwen3-Embedding-0.6B-GGUF

embeddings
lemonade-server pull Qwen3-Embedding-0.6B-GGUF
CheckpointQwen/Qwen3-Embedding-0.6B-GGUF
GGUF VariantQwen3-Embedding-0.6B-Q8_0.gguf
Recipellamacpp
Size (GB)0.64

Qwen3-Embedding-4B-GGUF

embeddings
lemonade-server pull Qwen3-Embedding-4B-GGUF
CheckpointQwen/Qwen3-Embedding-4B-GGUF
GGUF VariantQwen3-Embedding-4B-Q8_0.gguf
Recipellamacpp
Size (GB)4.28

Qwen3-Embedding-8B-GGUF

embeddings
lemonade-server pull Qwen3-Embedding-8B-GGUF
CheckpointQwen/Qwen3-Embedding-8B-GGUF
GGUF VariantQwen3-Embedding-8B-Q8_0.gguf
Recipellamacpp
Size (GB)8.05

Devstral-Small-2507-GGUF

codingtool-calling
lemonade-server pull Devstral-Small-2507-GGUF
Checkpointmistralai/Devstral-Small-2507_gguf
GGUF VariantQ4_K_M
Recipellamacpp
Size (GB)14.3

Qwen2.5-Coder-32B-Instruct-GGUF

coding
lemonade-server pull Qwen2.5-Coder-32B-Instruct-GGUF
CheckpointQwen/Qwen2.5-Coder-32B-Instruct-GGUF
GGUF VariantQ4_K_M
Recipellamacpp
Size (GB)19.85

gpt-oss-120b-mxfp-GGUF

hotreasoningtool-calling
lemonade-server pull gpt-oss-120b-mxfp-GGUF
Checkpointggml-org/gpt-oss-120b-GGUF
GGUF Variant*
Recipellamacpp
Size (GB)63.3

gpt-oss-20b-mxfp4-GGUF

hotreasoningtool-calling
lemonade-server pull gpt-oss-20b-mxfp4-GGUF
Checkpointggml-org/gpt-oss-20b-GGUF
Recipellamacpp
Size (GB)12.1

GLM-4.5-Air-UD-Q4K-XL-GGUF

reasoning
lemonade-server pull GLM-4.5-Air-UD-Q4K-XL-GGUF
Checkpointunsloth/GLM-4.5-Air-GGUF
GGUF VariantUD-Q4_K_XL
Recipellamacpp
Size (GB)73.1

GLM-4.7-Flash-GGUF

tool-calling
lemonade-server pull GLM-4.7-Flash-GGUF
Checkpointunsloth/GLM-4.7-Flash-GGUF
GGUF VariantGLM-4.7-Flash-UD-Q4_K_XL.gguf
Recipellamacpp
Size (GB)17.6

granite-4.0-h-tiny-GGUF

tool-calling
lemonade-server pull granite-4.0-h-tiny-GGUF
Checkpointunsloth/granite-4.0-h-tiny-GGUF
GGUF VariantQ4_K_M
Recipellamacpp
Size (GB)4.25

LFM2-8B-A1B-GGUF

lemonade-server pull LFM2-8B-A1B-GGUF
CheckpointLiquidAI/LFM2-8B-A1B-GGUF
GGUF VariantQ4_K_M
Recipellamacpp
Size (GB)4.8

LFM2-24B-A2B-GGUF

lemonade-server pull LFM2-24B-A2B-GGUF
CheckpointLiquidAI/LFM2-24B-A2B-GGUF
GGUF VariantQ4_K_M
Recipellamacpp
Size (GB)14.4

Whisper-Tiny

audiotranscription
lemonade-server pull Whisper-Tiny
Checkpoints[object Object]
Recipewhispercpp
Size (GB)0.075

Whisper-Base

audiotranscription
lemonade-server pull Whisper-Base
Checkpoints[object Object]
Recipewhispercpp
Size (GB)0.142

Whisper-Small

audiotranscription
lemonade-server pull Whisper-Small
Checkpoints[object Object]
Recipewhispercpp
Size (GB)0.466

Whisper-Medium

audiotranscription
lemonade-server pull Whisper-Medium
Checkpoints[object Object]
Recipewhispercpp
Size (GB)1.42

Whisper-Large-v3

audiotranscription
lemonade-server pull Whisper-Large-v3
Checkpoints[object Object]
Recipewhispercpp
Size (GB)2.87

Whisper-Large-v3-Turbo

audiotranscriptionhot
lemonade-server pull Whisper-Large-v3-Turbo
Checkpoints[object Object]
Recipewhispercpp
Size (GB)1.55

SD-Turbo

image
lemonade-server pull SD-Turbo
Checkpointstabilityai/sd-turbo
GGUF Variantsd_turbo.safetensors
Recipesd-cpp
Size (GB)5.2
Default Steps4
Default CFG Scale1
Default Size512x512

SDXL-Turbo

image
lemonade-server pull SDXL-Turbo
Checkpointstabilityai/sdxl-turbo
GGUF Variantsd_xl_turbo_1.0_fp16.safetensors
Recipesd-cpp
Size (GB)6.9
Default Steps4
Default CFG Scale1
Default Size512x512

SDXL-Base-1.0

image
lemonade-server pull SDXL-Base-1.0
Checkpointstabilityai/stable-diffusion-xl-base-1.0
GGUF Variantsd_xl_base_1.0.safetensors
Recipesd-cpp
Size (GB)6.9
Default Steps20
Default CFG Scale7.5
Default Size1024x1024

Flux-2-Klein-4B

imageedit
lemonade-server pull Flux-2-Klein-4B
Checkpoints[object Object]
Recipesd-cpp
Size (GB)16
Default Steps4
Default CFG Scale1
Default Size1024x1024

Flux-2-Klein-9B-GGUF

imageedit
lemonade-server pull Flux-2-Klein-9B-GGUF
Checkpoints[object Object]
Recipesd-cpp
Size (GB)18
Default Steps4
Default CFG Scale1
Default Size1024x1024

Qwen-Image-GGUF

image
lemonade-server pull Qwen-Image-GGUF
Checkpoints[object Object]
Recipesd-cpp
Size (GB)10
Default Steps20
Default CFG Scale2.5
Default Size512x512
Recipe Options[object Object]

Qwen-Image-2512-GGUF

image
lemonade-server pull Qwen-Image-2512-GGUF
Checkpoints[object Object]
Recipesd-cpp
Size (GB)12
Default Steps20
Default CFG Scale2.5
Default Size512x512
Recipe Options[object Object]

Z-Image-Turbo

image
lemonade-server pull Z-Image-Turbo
Checkpoints[object Object]
Recipesd-cpp
Size (GB)20
Default Steps9
Default CFG Scale1
Default Size1024x1024

RealESRGAN-x4plus

esrganimage
lemonade-server pull RealESRGAN-x4plus
Checkpointamd/realesrgan-x4plus
GGUF VariantRealESRGAN_x4plus.pth
Recipesd-cpp
Size (GB)0.064

RealESRGAN-x4plus-anime

esrganimage
lemonade-server pull RealESRGAN-x4plus-anime
Checkpointamd/realesrgan-x4plus-anime-6b
GGUF VariantRealESRGAN_x4plus_anime_6B.pth
Recipesd-cpp
Size (GB)0.017

kokoro-v1

ttsspeech
lemonade-server pull kokoro-v1
Checkpointmikkoph/kokoro-onnx
Recipekokoro
Size (GB)0.34