Understanding Model Properties
Reasoning ability:- Models with reasoning think through problems step-by-step
- Better for complex rewrites, maintaining consistency, and story logic
- Slower but more thorough
- How much text the model can process at once
- Larger windows = can handle more of your script at once
- Important for long scenes or multi-scene analysis
- How quickly the model responds
- ★★★★ = Fast (seconds)
- ★★ = Slower (10-30 seconds)
- Each request deducts credits from your account
- Lower cost = more requests per month
- Higher cost often = better quality
Available Models
OpenAI Models
| Model | Cost | Speed | Reasoning | Context | Best For |
|---|---|---|---|---|---|
| GPT-4.1 mini | 0.25 | ★★★★ | No | 1M tokens | Quick edits, dialogue polish, everyday tasks. Fastest and cheapest. |
| GPT-4o Mini | 0.5 | ★★★ | No | 200K tokens | Budget-friendly for routine revisions and brainstorming. |
| GPT-4o | 1 | ★★★ | No | 128K tokens | Balanced quality and speed for general rewriting. |
| o4 mini | 1 | ★★★ | Yes (medium) | 200K tokens | Structured reasoning for scene restructuring and plot logic. |
| o3-mini | 1 | ★★★ | Yes (high) | 200K tokens | Strong reasoning for complex multi-scene consistency. |
| GPT-4.1 | 1 | ★★★ | No | 1M tokens | Very large context for analyzing full acts or long sequences. |
Google Models
| Model | Cost | Speed | Reasoning | Context | Best For |
|---|---|---|---|---|---|
| Gemini 2.0 flash | 0.5 | ★★ | No | 1M tokens | Budget option with huge context for long-form analysis. |
| Gemini 2.5 flash | 1 | ★★ | Yes (high) | 1M tokens | High reasoning with massive context for complex story work. |
Anthropic Models (Creative & Nuanced)
| Model | Cost | Speed | Reasoning | Context | Best For |
|---|---|---|---|---|---|
| Haiku 4.5 | 0.25 | ★★★ | No | 200K tokens | Fast, creative dialogue and character voice. Often more natural prose. |
| Sonnet 4.5 ⭐ | 1 | ★★ | Yes (high) | 200K tokens | Most creative and nuanced. Excellent for character depth and subtext. Pro plan only. |
Choosing the Right Model
For Quick Tasks (Speed Priority)
Best: GPT-4.1 mini, Haiku 4.5- Fast responses
- Low cost (0.25 credits)
- Perfect for: Dialogue tweaks, quick brainstorming, line edits
For Budget Writing (Cost Priority)
Best: GPT-4.1 mini (0.25), Haiku 4.5 (0.25), GPT-4o Mini (0.5), Gemini 2.0 flash (0.5)- Maximize your free 100 credits
- ~200-400 requests per month
- Perfect for: Daily writing assistance, exploratory brainstorming
For Creative Writing (Quality Priority)
Best: Sonnet 4.5 ⭐ (Pro only), Haiku 4.5, GPT-4.1- More natural, nuanced prose
- Better character voice
- Perfect for: Dialogue refinement, character development, subtext
For Complex Story Problems (Reasoning Priority)
Best: Sonnet 4.5 ⭐, o3-mini, Gemini 2.5 flash, o4 mini- Step-by-step reasoning
- Better consistency across scenes
- Perfect for: Plot logic, multi-scene rewrites, structural issues
For Long Scripts (Context Priority)
Best: GPT-4.1 (1M tokens), Gemini models (1M tokens)- Can process entire acts at once
- Better for continuity checks
- Perfect for: Analyzing full sequences, maintaining consistency
Switch Models
1
Open Quill AI
Press
Cmd/Ctrl+L to open the Quill panel2
Click model selector
Click the model name at the top of the panel (shows current model)
3
Choose new model
Browse by provider (OpenAI, Google, Anthropic) and select a model
4
Confirm
The new model name appears in the selector. Your next message uses this model.
You can switch models mid-conversation. The change applies from your next message onward without affecting earlier responses.
Model Recommendations by Task
Dialogue refinement: Haiku 4.5, Sonnet 4.5, GPT-4.1 mini Plot brainstorming: o3-mini, Sonnet 4.5, GPT-4o Scene structure: o4 mini, Gemini 2.5 flash, o3-mini Character development: Sonnet 4.5 ⭐, Haiku 4.5, GPT-4o Quick edits: GPT-4.1 mini, Haiku 4.5 Full script analysis: GPT-4.1, Gemini 2.5 flash (large context) Budget-friendly daily use: GPT-4.1 mini, Haiku 4.5, GPT-4o MiniFrequently Asked Questions
Which model is best overall?
Which model is best overall?
There’s no single “best” model—it depends on your task. Sonnet 4.5 (Pro only) is the most creative and nuanced, but costs 1 credit per request. For everyday use, GPT-4.1 mini or Haiku 4.5 (0.25 credits) offer the best value.
What makes Anthropic models more creative?
What makes Anthropic models more creative?
Anthropic’s Claude models (Haiku and Sonnet) tend to produce more natural, literary prose with better character voice and subtext. They’re particularly good at dialogue and maintaining consistent character personalities.
Should I use reasoning models for all tasks?
Should I use reasoning models for all tasks?
No. Reasoning models (o4 mini, o3-mini, Sonnet 4.5, Gemini 2.5 flash) are slower and better for complex problems. For simple edits or brainstorming, regular models are faster and just as effective.
Can I access Sonnet 4.5 on the free plan?
Can I access Sonnet 4.5 on the free plan?
No. Sonnet 4.5 requires a Pro plan subscription. All other models are available on both free and Pro plans.
Why are some models slower?
Why are some models slower?
Models with reasoning capabilities or larger context windows take longer to process. Speed ratings (★) help you choose based on how quickly you need a response.
Does a more expensive model always give better results?
Does a more expensive model always give better results?
Not always. Cost reflects computational requirements, not always output quality for your specific task. A 0.25 credit model might be perfect for quick dialogue tweaks, while a 1 credit model shines in complex story logic.