Example models
Pre-built, pre-reduced, ready to download. Each model is a single executable — no setup, no internet, no GPU required.
Joke Teller
JokesOur flagship demo. A 3B parameter model surgically reduced with Q4K quantization, then LoRA-finetuned on 271 joke examples. Tells coherent jokes, handles follow-ups, and answers general questions — all in a single 2 GB executable with a built-in web UI.
Based on LLaMA 3.2 3B
EN → FR Translator
TranslationA task-specific translator that only speaks English and French. Vocabulary pruning strips tokens for every other language, and surgery removes layers the model doesn't need for translation.
Based on TBD
Code Assistant
CodeA focused coding assistant that writes, explains, and debugs code. Stripped of general chat, creative writing, and multilingual capability — just code.
Based on TBD
Need a different model or task?
These are examples. We can build custom task-specific models for your exact use case.
Make an Enquiry