What is the difference between “chat” and “reasoner”?
deepseek-chatis the standard mode (V3.2 Non-thinking) for fast answers.deepseek-reasoneractivates the “Thinking” mode (V3.2 Thinking), where the model pauses to generate a hidden chain of thought before answering, perfect for hard math problems.Can I run DeepSeek V3.2 locally?
You can run the smaller distilled versions (like 7B or 8B) on a consumer GPU. However, the full V3.2 671B model requires massive VRAM (multiple A100/H100 GPUs), making it practical only for data centers.
How much does it cost?
The API pricing is $0.28 per million input tokens (or $0.028 if cached!) and $0.42 per million output tokens. This is roughly 10-20x cheaper than GPT-4o.









