Skip to main content

Which Chat AI models does Magai have access to?

Magai's current list of available LLM (Large Language Models)

Paul Gaurano avatar
Written by Paul Gaurano
Updated over a week ago

Magai offers 30 different AI models across various categories within a single, unified interface. Users can seamlessly switch between these models to optimize their workflows and leverage the unique strengths of each model for different tasks. Here's a comprehensive overview of our current active models.

Model Overview Table

Model

Context Window

Multiplier

Auto

128K

1x

Claude 3.5 Haiku

200K

0.5x

Claude 3.5 Sonnet

200K

1x

Claude 3.7 Sonnet

200K

3x

DeepSeek R1

64K

0.5x

DeepSeek V3

64K

0.1x

Gemini 2.5 Pro

1M

0.8x

Gemini 2.5 Flash

1M

0.1x

GPT-4.1

1M

0.7x

GPT-4.1 Mini

1M

0.2x

GPT-4.1 Nano

1M

0.1x

GPT-4o

128K

1x

Grok 3

131K

2x

Grok 3 Mini

131K

0.1x

Grok 4

131K

2x

Llama 4 Maverick

1M

0.1x

Llama 4 Scout

328K

0.1x

Mistral

128K

1x

Mistral Pixtral

128K

1x

Nemotron 70B

131K

0.03x

Nova Micro

128K

0.01x

Nova Pro

300K

0.3x

o1

200K

3x

o1 Mini

128K

1x

o3 Mini

200K

0.5x

Perplexity Deep Research

200K

2x

Perplexity Sonar

127K

0.3x

Perplexity Sonar Pro

200K

2x

Voice GPT

-

1x

Detailed Model Descriptions

1. Auto

Context Window: 128K Multiplier: 1x Overview: Auto intelligently selects the most appropriate model for your task, optimizing for both performance and word balance efficiency.

2. Claude 3.5 Haiku

Context Window: 200K Multiplier: 0.5x Overview: Claude 3.5 Haiku delivers fast, efficient responses while maintaining high quality. It's optimized for quick interactions and routine tasks with excellent cost efficiency.

Key Features:

  • Enhanced Context Management

  • Efficient Language Generation

  • Excellent Balance of Speed and Quality

  • Cost-Effective Processing

Use Cases:

  • Routine customer support

  • Quick content drafting

  • Efficient information retrieval

  • Day-to-day assistance

3. Claude 3.5 Sonnet

Context Window: 200K Multiplier: 1x Overview: Claude 3.5 Sonnet is an advanced iteration of the Claude series, developed by Anthropic. It balances sophisticated language capabilities with reasonable processing costs.

Key Features:

  • Enhanced Context Management

  • Refined Language Generation

  • Safety and Alignment

  • Customization Options

Use Cases:

  • Professional content creation and editing

  • Sophisticated customer support

  • Educational tools and tutoring systems

  • Business research and analysis

4. Claude 3.7 Sonnet

Context Window: 200K Multiplier: 3x Overview: The latest and most advanced version in the Claude Sonnet line, offering exceptional reasoning, accuracy, and deeper contextual understanding.

Key Features:

  • Superior reasoning capabilities

  • Enhanced nuance in responses

  • Advanced instruction following

  • Improved handling of complex requests

Use Cases:

  • Complex problem-solving

  • Advanced content creation

  • Professional research assistance

  • High-stakes decision support

5. DeepSeek R1

Context Window: 64K Multiplier: 0.5x Overview: DeepSeek R1 specializes in research-oriented tasks, with excellent reasoning capabilities at a moderate cost.

6. DeepSeek V3

Context Window: 64K Multiplier: 0.1x Overview: A cost-effective option with solid performance for everyday tasks and lightweight applications.

7. Gemini 2.5 Flash

Context Window: 1M
Multiplier: 0.1x
Overview: Gemini 2.5 Flash is Google's fastest and most cost-efficient large language model to date. Designed for real-time performance, it delivers lightning-fast responses across an expansive 1 million token context window—making it perfect for applications that demand both speed and scale without compromising coherence.

Key Features:

  • 1M token context window for handling complex, long documents

  • Exceptional speed and responsiveness

  • Low compute cost for high-volume usage

  • Multilingual and multimodal readiness (vision support in some deployments)

Use Cases:

  • Real-time chatbots and virtual assistants

  • High-frequency content generation and iteration

  • Long document parsing and summarization at scale

  • Fast Q&A systems, search agents, or browser copilots

8. Gemini 2.5 Pro

Context Window: 1M Multiplier: 0.8x Overview: Google's advanced multimodal model featuring an extensive context window and powerful reasoning capabilities for complex tasks.

9. GPT-4.1

Context Window: 1M
Multiplier: 0.7x
Overview: GPT-4.1 is a long-context powerhouse developed by OpenAI, offering advanced reasoning, reliable coherence, and the ability to handle extremely large documents. With a context window of 1 million tokens and an efficient usage multiplier, it’s ideal for users who need both depth and scale in their outputs.

Key Features:

  • Massive 1M token context window for seamless handling of long inputs

  • High accuracy and logical reasoning across diverse topics

  • Maintains consistency over long-form content

  • Excellent balance of quality and efficiency for premium applications

Use Cases:

  • Document analysis, contract review, and legal summaries

  • Book-length content generation or editing

  • Long conversational agents and memory-heavy chatbots

  • Data synthesis and multi-source research projects

10. GPT-4.1 Mini

Context Window: 1M
Multiplier: 0.2x
Overview: GPT-4.1 Mini is a lighter, more efficient variant of GPT-4.1. It retains much of the reasoning power and long-context support of its full-size counterpart while significantly reducing compute cost, making it a practical option for teams and individuals with high-volume needs.

Key Features:

  • Full 1M token context window

  • Strong reasoning and natural language capabilities

  • Optimized for affordability and scalability

  • Excellent performance for routine or semi-complex tasks

Use Cases:

  • Document summarization and planning

  • Scalable chatbots and workflow automation

  • Email generation and rewriting

  • Knowledge base and technical writing

11. GPT-4.1 Nano

Context Window: 1M
Multiplier: 0.1x
Overview: GPT-4.1 Nano is the most cost-efficient model in the GPT-4.1 lineup. It offers solid performance for general use cases while minimizing resource consumption. Ideal for high-frequency, low-complexity interactions or budget-conscious teams needing consistent output across long contexts.

Key Features:

  • 1M token context window with ultra-low cost

  • Lightweight design suitable for continuous use

  • Quick response times

  • Maintains core comprehension abilities

Use Cases:

  • High-volume support tickets and chatbot flows

  • Simple report generation and QA drafts

  • Lightweight assistants for data entry or sorting

  • Routine educational or content tasks

12. Grok 3

Context Window: 131K Multiplier: 2x Overview: Grok 3 is a next-generation large language model developed by xAI, designed to offer cutting-edge reasoning capabilities with a touch of personality. It excels in nuanced understanding, creative ideation, and conversational flow, making it a powerful tool for users seeking intelligent, witty, and contextually aware assistance.

Key Features:

  • Advanced reasoning and contextual comprehension

  • Quirky, opinionated tone with real-time awareness

  • Capable of handling creative, technical, and conversational prompts

  • Developed with safety and alignment frameworks

Use Cases:

  • Brainstorming and ideation for creative writing or product design

  • Engaging conversation agents and chat-based tools

  • Support for complex reasoning or logic-based problem-solving

  • Educational tools with more interactive, human-like behavior

13. Grok 3 Mini

Context Window: 131K Multiplier: 0.1x Overview: Grok 3 Mini is a lightweight, efficient variant of the Grok series by xAI. It retains the core personality and conversational strengths of its larger counterpart but is optimized for speed and affordability, making it ideal for high-volume or everyday use without sacrificing intelligence.

Key Features:

  • Fast response times with minimal resource usage

  • Retains Grok's unique tone and creative flair

  • Supports a wide range of casual and structured tasks

  • Cost-effective option for daily interactions

Use Cases:

  • Chat assistants with personality and speed

  • Lightweight customer support or chatbot applications

  • Rapid brainstorming or creative writing drafts

  • Educational tools or tutors for casual learning environments

14. Grok 4

Context Window: 256K Multiplier: 2x Overview: Grok 4 is the most advanced model in xAI’s Grok series, offering expanded context handling, deeper reasoning, and improved response quality. Designed for users who want cutting-edge intelligence paired with Grok’s signature wit, it excels at both technical depth and conversational richness.

Key Features:

  • Large 256K context window for in-depth prompts and multi-turn conversations

  • Strong logical reasoning and memory capabilities

  • Witty, engaging tone that mimics human-like interaction

  • Enhanced instruction following and nuanced comprehension

Use Cases:

  • In-depth research, summarization, or analysis across long documents

  • High-end creative work and ideation sessions

  • Advanced chatbot implementations and assistants

  • Complex Q&A systems and customer interaction tools

15. Llama 4 Maverick

Context Window: 1M Multiplier: 0.1x Overview: Meta's advanced large language model featuring an extensive 1M context window with an extremely cost-effective multiplier.

16. Llama 4 Scout

Context Window: 328K Multiplier: 0.1x Overview: A more efficient Llama 4 variant with a good balance of context length and extremely low cost.

17. Mistral

Context Window: 128K Multiplier: 1x Overview: Mistral's flagship model, designed to deliver high-performance language processing with a focus on reasoning and factuality.

Key Features:

  • High performance

  • Context-aware responses

  • Advanced reasoning capabilities

  • Cutting-edge architecture

Use Cases:

  • Enterprise content strategies

  • Technical documentation

  • Research assistance

  • Professional communication

18. Mistral Pixtral

Context Window: 128K Multiplier: 1x Overview: Mistral's multimodal model that combines text and image understanding capabilities.

19. Nemotron 70B

Context Window: 131K Multiplier: 0.03x Overview: Offering one of the lowest usage multipliers on the platform, Nemotron 70B provides excellent value while maintaining solid performance.

20. Nova Micro

Context Window: 128K Multiplier: 0.01x Overview: The most cost-effective model on the platform, ideal for high-volume, routine tasks where efficiency is paramount.

21. Nova Pro

Context Window: 300K Multiplier: 0.3x Overview: Featuring an extended context window with enhanced capabilities, Nova Pro balances advanced features and reasonable cost.

22. o3

Context Window: 200K
Multiplier: 0.7x
Overview: O3 is a mid-tier, high-efficiency language model designed to deliver strong performance with a focus on practicality. It offers a generous context window and well-rounded reasoning capabilities, making it suitable for a wide range of general-purpose tasks at a reasonable compute cost.

Key Features:

  • Balanced performance across creative and analytical tasks

  • 200K token context window supports extended conversations

  • Fast, responsive output with strong accuracy

  • Ideal for daily workflows and production-level use

Use Cases:

  • Long-context chatbots and customer service agents

  • Reliable content creation and summarization

  • Internal tools and business logic applications

  • Technical writing, planning, and report generation

23. o3 Mini

Context Window: 200K
Multiplier: 0.5x
Overview: O3 Mini is a newer-generation model designed to offer a strong balance between performance and cost. With a 200K token context window and efficient processing, it’s a reliable choice for users who need depth without the premium price tag.

Key Features:

  • Long 200K context window for handling extended inputs

  • Solid reasoning and response quality

  • Lightweight performance ideal for scaling

  • Balanced cost-to-capability ratio

Use Cases:

  • Long-form content generation and editing

  • Research, analysis, and summarization tasks

  • Scalable chatbots and assistants for business workflows

  • Documentation and knowledge base creation

24. o4 Mini

Context Window: 200K
Multiplier: 1x
Overview: O4 Mini is a streamlined, high-performance language model optimized for general-purpose tasks. Built on the latest advancements in OpenRouter infrastructure, it offers solid reasoning, fast responses, and reliable accuracy with a generous context window—making it a dependable option for both individuals and teams.

Key Features:

  • Strong performance across diverse task types

  • 200K token context window enables longer interactions

  • Balanced speed and quality for production-scale workflows

  • Consistent and reliable outputs

Use Cases:

  • Team productivity tools and smart assistants

  • Content generation, rewriting, and editing

  • Technical support bots and documentation helpers

  • Educational platforms and interactive learning tools

25. Perplexity Deep Research

Context Window: 200K
Multiplier: 2x
Overview: Advanced research-focused model with extensive knowledge retrieval and synthesis capabilities.

26. Perplexity Sonar

Context Window: 127K Multiplier: 0.3x Overview: Specialized for information retrieval and synthesis at a cost-effective multiplier rate.

27. Perplexity Sonar Pro

Context Window: 200K Multiplier: 2x Overview: Premium version of Perplexity Sonar with enhanced capabilities and expanded context window.

28. Voice GPT

Context Window: - Multiplier: 1x Overview: Specialized for voice interactions and speech processing applications.

Choosing the Right Model

When selecting a model on Magai, consider:

Context Window Requirements

For processing long documents or maintaining extended conversations, choose models with larger context windows:

  • Extensive Context: Gemini 2.5 Pro (1M), Llama 4 Maverick (1M), Gemini 2.0 Flash (1M)

  • Medium Context: Nova Pro (300K), Llama 4 Scout (328K), Claude models (200K)

  • Standard Context: Most other models (128K-131K)

Cost Efficiency

Models with lower multipliers will use your word balance more efficiently:

  • Most Efficient: Nova Micro (0.01x), Nemotron 70B (0.03x), Llama 4 Scout (0.03x)

  • Very Efficient: Llama 4 Maverick (0.05x), DeepSeek V3 (0.1x), Gemini 2.0 models (0.1x)

  • Moderately Efficient: Nova Pro (0.3x), Perplexity Sonar (0.3x), Claude 3.5 Haiku (0.5x)

Task Complexity

For complex reasoning or critical applications:

  • Premium Performance: GPT-4.5 (20x), Claude 3.7 Sonnet (3x), o1 (3x)

  • Balanced Performance: GPT-4o (1x), Claude 3.5 Sonnet (1x), Mistral (1x)

  • Research-Oriented: Perplexity Deep Research (2x), Perplexity Sonar Pro (2x)

Specialized Needs

Consider models with specific strengths matching your use case:

  • Visual Processing: Grok 2 Vision, Mistral Pixtral

  • Voice Interaction: Voice GPT

  • Information Retrieval: Perplexity models

  • Long Document Processing: Gemini models, Llama 4 Maverick

For assistance selecting the optimal model for your specific needs, please use the Auto option which intelligently selects the most appropriate model for each task based on its requirements.For the most up-to-date information on all available AI models and their specific functionalities, please visit Magai's help center.

Did this answer your question?