← Back | TextMate AI

Models

Every chat lets you pick from 8 models served via NVIDIA NIM. 3 are free; the rest unlock on Pro.

Free models

  • GPT-OSS 120B (default) — balanced & fast, ~131K context window, up to 120,000 output tokens
  • GPT-OSS 20B — lightweight GPT-OSS, up to 120,000 output tokens
  • Mistral Small 4 — fastest response

Pro models

  • Mistral Large 3 — largest, highest quality
  • Llama 4 Maverick — fast & accurate, up to 16,384 output tokens
  • GLM 5.1 — strong reasoning
  • Qwen 3.5 122B — balanced Qwen
  • Kimi K2.6 — long-context reasoning