AI Models

Learn about the different AI models available on Vectly and how to choose the right one for your needs.

Last Updated: 5/27/2025

AI Models on Vectly

Vectly provides access to multiple state-of-the-art AI models, each with unique strengths and optimal use cases. This guide helps you choose the right model for your needs.

Available Models

Claude Models (Anthropic)

Claude 4 Opus

Best for: Complex reasoning, detailed analysis, creative writing
Strengths:
- Exceptional reasoning capabilities
- Nuanced understanding of context
- Strong safety features
- Great for technical documentation
Context Window: 200K tokens
Credit Cost: Higher (see pricing)

Claude 4 Sonnet

Best for: Balanced performance and cost
Strengths:
- Good reasoning abilities
- Faster than Opus
- Cost-effective for most tasks
- Reliable for code generation
Context Window: 200K tokens
Credit Cost: Medium

GPT Models (OpenAI)

GPT-4.1 Turbo

Best for: General-purpose tasks, code generation, analysis
Strengths:
- Excellent code understanding
- Strong general knowledge
- Fast response times
- Good at following instructions
Context Window: 128K tokens
Credit Cost: Medium-High

GPT-4o

Best for: Multimodal tasks, vision understanding
Strengths:
- Can analyze images
- Native multimodal understanding
- Efficient processing
- Good for visual content
Context Window: 128K tokens
Credit Cost: Medium

GPT-4o Mini

Best for: Quick queries, simple tasks, testing
Strengths:
- Very fast responses
- Low credit usage
- Good for simple questions
- Ideal for development/testing
Context Window: 128K tokens
Credit Cost: Low

O1 Models (OpenAI)

O1 Preview

Best for: Complex reasoning, mathematical problems, coding challenges
Strengths:
- Advanced reasoning capabilities
- Step-by-step problem solving
- Excellent for STEM tasks
- Self-correcting reasoning
Context Window: 128K tokens
Credit Cost: Very High

O1 Mini

Best for: Reasoning tasks with budget constraints
Strengths:
- Good reasoning abilities
- More affordable than O1 Preview
- Faster processing
- Suitable for moderate complexity
Context Window: 128K tokens
Credit Cost: High

Choosing the Right Model

By Use Case

📝 Writing & Content Creation

Best: Claude 4 Opus
Alternative: GPT-4.1 Turbo
Budget: Claude 4 Sonnet

💻 Code Generation & Debugging

Best: GPT-4.1 Turbo
Alternative: Claude 4 Sonnet
Budget: GPT-4o Mini

🧮 Complex Problem Solving

Best: O1 Preview
Alternative: Claude 4 Opus
Budget: O1 Mini

🔍 Research & Analysis

Best: Claude 4 Opus
Alternative: GPT-4.1 Turbo
Budget: Claude 4 Sonnet

🖼️ Image Analysis

Best: GPT-4o
Alternative: GPT-4.1 Turbo (text only)

⚡ Quick Questions

Best: GPT-4o Mini
Alternative: Claude 4 Sonnet

Model Features

Reasoning Mode

Available for Claude and O1 models:

Visible Reasoning (Claude): See the AI's thought process
Hidden Reasoning (O1): Advanced reasoning without visible steps

Web Search

Compatible with all models:

Search current information
Verify facts and data
Find recent events
Additional credit cost applies

Prompt Caching

Reduces costs for repetitive contexts:

Available for Claude models
Automatically caches repeated prompts
Significant savings for long contexts

Credit Usage

Credit consumption varies by:

Model Selected: Premium models cost more
Input Length: Longer prompts use more credits
Output Length: Longer responses cost more
Features Used: Web search, reasoning add costs

Approximate Credit Costs (per 1K tokens)

Model	Input	Output
GPT-4o Mini	0.1	0.3
Claude 4 Sonnet	0.3	1.5
GPT-4.1 Turbo	0.5	1.5
GPT-4o	0.5	1.5
Claude 4 Opus	1.5	7.5
O1 Mini	3.0	12.0
O1 Preview	15.0	60.0

Best Practices

1. Start Small

Test with GPT-4o Mini first
Upgrade to premium models if needed
Use prompt caching for repeated contexts

2. Match Model to Task

Don't use O1 Preview for simple questions
Don't use Mini models for complex analysis
Consider context window limits

3. Optimize for Credits

Be concise in your prompts
Use system prompts effectively
Enable caching when applicable
Monitor usage in real-time

4. Leverage Strengths

Use Claude for nuanced writing
Use GPT-4 for code and general tasks
Use O1 for complex reasoning
Use Mini models for iteration

Model Comparison Table

Feature	Claude 4 Opus	Claude 4 Sonnet	GPT-4.1 Turbo	GPT-4o	GPT-4o Mini	O1 Preview	O1 Mini
Best For	Complex tasks	Balanced use	General purpose	Multimodal	Quick queries	Reasoning	Budget reasoning
Speed	Slow	Medium	Fast	Fast	Very Fast	Slow	Medium
Context	200K	200K	128K	128K	128K	128K	128K
Reasoning	Excellent	Good	Good	Good	Basic	Exceptional	Very Good
Code	Very Good	Good	Excellent	Very Good	Good	Excellent	Very Good
Cost	High	Medium	Medium	Medium	Low	Very High	High

Coming Soon

We're constantly adding new models:

Gemini Pro models
Specialized fine-tuned models
Custom model deployment
Model mixing capabilities

Stay tuned for updates!