Fine-tuned, domain-specific language models. Fast inference. Open weights. Bottensor builds specialized AI — one model per problem, shipped end-to-end.
Seven models. One intelligence stack.
Purpose-built models, each fine-tuned for a single domain using QLoRA + Unsloth. Composable on their own — together they form a complete AI stack.
A 32B finance specialist fine-tuned on millions of curated market examples. Quantized for production inference and serving live today via OpenAI-compatible API.
A small, distilled model for latency-critical inference. Optimized for edge deployment, agent loops, and high-throughput serving where every millisecond matters.
Production-grade code generation across the languages we use most — Python, TypeScript, Solidity, Java. Trained on private high-quality repositories.
Deep reasoning for complex multi-step analysis. Logic trees, step-by-step decomposition, and verifiable chains for problems that need a model to actually think.
Built for autonomous agent workflows. Tool calling, multi-step planning, and execution pipelines for real-world automation that has to run unattended.
Extended context window for document understanding, codebase reasoning, and retrieval-free workflows. For when 128K isn’t enough.
Vision-language-action model for robotic control. Continuous learning policies that train in simulation and transfer to physical hardware at the edge.
NPC Fin
A 32B parameter finance specialist — fine-tuned on proprietary market data, quantized for production inference, and serving live today through an OpenAI-compatible API.
Access NPC Fin — live now
OpenAI-compatible API for direct access to NPC Fin. First 100 API keys are free with 1M tokens each.
Bottensor — a Falcon Hash company.
We build small, fast, specialized AI models for problems generalists can't solve well. The NPC Model Family is our long-term project: one fine-tuned model per real-world domain, shipped with open weights and an OpenAI-compatible API.
A 32B model trained on the right data beats a 400B generalist that has seen everything once. Every NPC model targets a single domain.
We release model weights and adapters on HuggingFace. Roughly 25% of our work is closed (data, recipes), 75% is open (weights, code, evals).
Data curation, fine-tuning with QLoRA + Unsloth, quantization, and serving with vLLM. We run the whole pipeline so we can iterate fast.
7+ years software engineering. MS in Computer Science. Builds the NPC Model Family end-to-end — data, training, serving, product.
Simple, honest pricing.
Start free. Upgrade when you ship. Pricing below is preview — final tiers locked in at GA.
For prototypes, hobby projects, and exploring NPC Fin.
- ✓1M tokens / month
- ✓OpenAI-compatible API
- ✓Open weights on HuggingFace
- ✓Community support
For production apps and small teams shipping with NPC models.
- ✓10M tokens / month
- ✓Higher rate limits
- ✓Streaming + tool use
- ✓Email support
For teams that need dedicated capacity, custom fine-tunes, or on-prem.
- ✓Unlimited tokens
- ✓Dedicated inference
- ✓Custom fine-tunes
- ✓SLA + private support
Built with
End-to-end AI infrastructure — from data pipelines to production inference.
Specialized models.
Shipped end-to-end.
Get an API key in under a minute. 1M tokens free, no credit card.