API Docs Models Bảng giá

Đăng nhập Bắt đầu

Models

Truy cập 100+ model AI qua 1 API. So sánh giá, context window, tốc độ.

Lọc:|

89 model

Auto (Smart Route)

Futrix API tự động chọn model tốt nhất dựa trên độ phức tạp của request. Câu đơn giản → model rẻ, câu khó → model mạnh. Tiết kiệm 40-60% chi phí.

ChatSmart RoutingCost OptimizationAuto Fallback

GPT-5.4

Trung Mới Hot

Model flagship mới nhất của OpenAI. Vượt trội về reasoning, code, sáng tạo. Context 256K, hỗ trợ vision và tools.

ChatVisionFunction CallingJSON ModeStreaming+1

Claude Sonnet 4.6

Trung Mới Hot

Model cân bằng mới nhất của Anthropic. Xuất sắc với code, phân tích, và viết nội dung dài. Context 200K tokens.

ChatVisionFunction CallingStreamingExtended Thinking+1

DeepSeek V3

Model open-weight mạnh ngang GPT-4o với giá cực rẻ. Cache hit giảm giá 10x. Tốt cho code và reasoning.

ChatFunction CallingJSON ModeStreamingFIM

Gemini 2.5 Flash

Model nhanh với khả năng thinking từ Google. Context 1M tokens, giá rẻ, hỗ trợ vision và grounding.

ChatVisionThinkingGroundingCode Execution+1

Gemini 2.5 Pro

Model mạnh nhất của Google. Context 1M tokens, thinking sâu, xuất sắc cho coding và phân tích phức tạp.

ChatVisionThinkingGroundingCode Execution+1

DeepSeek R1

Model reasoning chuyên sâu. Tự suy luận step-by-step. Xuất sắc với math, logic, code phức tạp.

ChatReasoningMathCodeStreaming

Groq Llama 3.3 70B

Llama 3.3 70B chạy trên Groq LPU — tốc độ inference cực nhanh (>300 tok/s). Giá rẻ.

ChatFunction CallingJSON ModeStreaming

O4 Mini

Model reasoning mới nhất từ OpenAI. Nhanh, rẻ hơn O3, tốt cho math và logic.

ChatReasoningMathCodeFunction Calling+1

Claude Opus 4.6

Premium Mới Hot

Model mạnh nhất của Anthropic. Xuất sắc cho tác vụ phức tạp, nghiên cứu, phân tích sâu.

ChatVisionFunction CallingStreamingExtended Thinking+2

Groq Compound

Agent tổng hợp từ Groq với khả năng search web, chạy code, truy cập Wikipedia. Miễn phí.

ChatWeb SearchCode ExecutionStreaming

Groq Compound Mini

Phiên bản nhẹ của Compound agent. Nhanh hơn, phù hợp tác vụ đơn giản. Miễn phí.

ChatWeb SearchCode ExecutionStreaming

GPT-5.4 Nano

Model siêu nhẹ dòng GPT-5.4. Cực rẻ và nhanh, phù hợp phân loại, trích xuất, chatbot đơn giản.

ChatFunction CallingJSON ModeStreaming

GPT-5 Nano

Model siêu nhẹ dòng GPT-5. Giá thấp nhất, tốc độ nhanh, phù hợp tác vụ đơn giản hàng loạt.

ChatFunction CallingJSON ModeStreaming

Groq Llama 3.1 8B

Model nhỏ chạy trên Groq LPU. Tốc độ cực nhanh, giá thấp nhất. Phù hợp tác vụ đơn giản.

ChatJSON ModeStreaming

Groq GPT-OSS 20B

OpenAI GPT open-source 20B chạy trên Groq. Nhanh và rẻ, chất lượng tốt cho tác vụ trung bình.

ChatFunction CallingJSON ModeStreaming

GPT-4.1 Nano

Model nhỏ nhất dòng GPT-4.1. Cực nhanh, rẻ, phù hợp xử lý text đơn giản, embedding, phân loại.

ChatFunction CallingJSON ModeStreaming

Gemini 2.5 Flash Lite

Phiên bản nhẹ nhất của Gemini Flash. Context 1M tokens, cực nhanh, giá rẻ nhất dòng Gemini.

ChatVisionStreamingGrounding

Gemini 2.0 Flash

Model nhanh từ Google với context 1M tokens. Hỗ trợ vision, grounding, code execution.

ChatVisionStreamingGroundingCode Execution

Groq Llama 4 Scout

Meta Llama 4 Scout 17B chạy trên Groq. Nhanh, rẻ, hỗ trợ vision.

ChatVisionFunction CallingStreaming

Groq GPT-OSS 120B

OpenAI GPT open-source 120B trên Groq. Mạnh, nhanh, giá rẻ hơn model closed-source.

ChatFunction CallingJSON ModeStreaming

GPT-4o Mini

Phiên bản nhẹ của GPT-4o. Cực rẻ, nhanh, phù hợp chatbot, phân loại, tóm tắt.

ChatVisionFunction CallingJSON ModeStreaming

GPT-5.4 Mini

Phiên bản nhẹ GPT-5.4. Cân bằng tốt giữa chất lượng và chi phí, nhanh, hỗ trợ tools.

ChatVisionFunction CallingJSON ModeStreaming

GPT-5 Mini

Phiên bản nhẹ GPT-5. Tốc độ nhanh, giá rẻ, phù hợp tác vụ hàng ngày.

ChatVisionFunction CallingJSON ModeStreaming

Groq Qwen3 32B

Qwen3 32B chạy trên Groq. Hỗ trợ reasoning, code tốt, tốc độ nhanh.

ChatReasoningCodeStreaming

GPT-4.1 Mini

Phiên bản nhẹ GPT-4.1. Nhanh, rẻ, hỗ trợ function calling và JSON mode tốt.

ChatVisionFunction CallingJSON ModeStreaming

Gemini 3 Flash

Gemini thế hệ 3 phiên bản Flash. Nhanh, mạnh hơn 2.5, context 1M tokens.

ChatVisionThinkingGroundingCode Execution+1

Gemini 3.1 Flash

Gemini 3.1 Flash — nâng cấp mới nhất dòng Flash. Nhanh, mạnh, context 1M tokens.

ChatVisionThinkingGroundingCode Execution+1

Groq Kimi K2

Moonshot Kimi K2 chạy trên Groq. Mạnh với reasoning và code, tốc độ inference nhanh.

ChatReasoningCodeStreaming

Groq ALLAM 2 7B

Miễn phí Mới

ALLAM 2 7B trên Groq — model chuyên tiếng Ả Rập và tiếng Anh. Miễn phí, tốc độ nhanh.

ChatMulti-languageStreaming

Claude Haiku 4.5

Model nhanh và rẻ nhất dòng Claude. Phù hợp cho chatbot, phân loại, tóm tắt.

ChatVisionFunction CallingStreaming

O3 Mini

Model reasoning nhẹ. Giá hợp lý, tốt cho coding và math.

ChatReasoningMathCodeStreaming

GPT-5

GPT-5 — bước nhảy lớn từ OpenAI. Mạnh hơn GPT-4o đáng kể, giá hợp lý.

ChatVisionFunction CallingJSON ModeStreaming+1

GPT-5.1

Nâng cấp từ GPT-5. Cải thiện reasoning và code, giá tương đương GPT-5.

ChatVisionFunction CallingJSON ModeStreaming+1

GPT-5.2

GPT-5.2 — cải thiện đáng kể về code và reasoning so với 5.1. Context 256K.

ChatVisionFunction CallingJSON ModeStreaming+1

GPT-4.1

Nâng cấp từ GPT-4o. Mạnh hơn với code, instruction following tốt hơn, context 1M tokens.

ChatVisionFunction CallingJSON ModeStreaming+1

O3

Model reasoning mạnh nhất dòng O. Xuất sắc cho math, science, coding phức tạp.

ChatReasoningMathCodeFunction Calling+1

Gemini 3 Pro

Gemini thế hệ 3 Pro. Mạnh hơn 2.5 Pro đáng kể, context 1M, thinking sâu.

ChatVisionThinkingGroundingCode Execution+1

Gemini 3.1 Pro

Gemini 3.1 Pro — mới nhất từ Google. Nâng cấp reasoning và code, context 1M tokens.

ChatVisionThinkingGroundingCode Execution+1

GPT-4o

Model đa năng mạnh của OpenAI. Hỗ trợ text, vision, audio. Tốc độ nhanh, chất lượng cao.

ChatVisionFunction CallingJSON ModeStreaming

Claude Sonnet 4.5

Claude Sonnet 4.5. Mạnh với code và sáng tạo, hỗ trợ extended thinking.

ChatVisionFunction CallingStreamingExtended Thinking+1

Claude Sonnet 4

Claude Sonnet 4. Model cân bằng tốt, giá hợp lý, xuất sắc cho code và phân tích.

ChatVisionFunction CallingStreamingExtended Thinking+1

Claude Opus 4.5

Claude Opus 4.5. Rất mạnh cho reasoning phức tạp, viết sáng tạo, phân tích chuyên sâu.

ChatVisionFunction CallingStreamingExtended Thinking+1

Claude Opus 4.1

Claude Opus 4.1. Mạnh cho tác vụ phức tạp, agentic workflows.

ChatVisionFunction CallingStreamingExtended Thinking+1

Claude Opus 4

Claude Opus 4. Model flagship đầu tiên dòng Opus 4, mạnh cho agentic coding.

ChatVisionFunction CallingStreamingExtended Thinking+1

O1

Model reasoning cao cấp. Suy luận sâu, phù hợp cho nghiên cứu, math, science.

ChatReasoningMathScienceCode

GPT-5.2 Pro

Phiên bản Pro của GPT-5.2. Reasoning cực mạnh, output dài, dành cho tác vụ chuyên sâu.

ChatVisionReasoningCodeFunction Calling+1

GPT-5.4 Pro

Model mạnh nhất của OpenAI. Reasoning cấp độ expert, dành cho tác vụ khó nhất.

ChatVisionReasoningCodeFunction Calling+1

GPT-5 Pro

Phiên bản Pro của GPT-5. Reasoning mạnh, output dài, dành cho tác vụ chuyên sâu.

ChatVisionReasoningCodeFunction Calling+1

GPT-5.3 Codex

Codex mới nhất — chuyên code, tối ưu cho agentic coding tasks, multi-file editing.

CodeMulti-file EditAgenticFunction CallingStreaming

GPT-5.2 Codex

Codex 5.2 — chuyên code, refactoring, debugging. Hỗ trợ multi-file editing.

CodeMulti-file EditAgenticFunction CallingStreaming

GPT-5.1 Codex

Codex 5.1 — model code chuyên dụng. Tốt cho refactoring và debugging.

CodeMulti-file EditAgenticFunction CallingStreaming

GPT-5.1 Codex Max

Codex Max — model code mạnh nhất, context khổng lồ, xử lý codebase lớn và refactor phức tạp.

CodeMulti-file EditAgenticFunction CallingStreaming+1

GPT-5 Codex

GPT-5 Codex — chuyên code, giá hợp lý.

CodeMulti-file EditAgenticFunction CallingStreaming

GPT-5.1 Codex Mini

Codex Mini — nhẹ, nhanh, rẻ, phù hợp autocomplete và code tasks đơn giản.

CodeAutocompleteFunction CallingStreaming

GPT-5 Search

GPT-5 với khả năng search web realtime. Trả lời kèm sources, thông tin mới nhất.

ChatWeb SearchCitationsStreaming

GPT-4o Search

GPT-4o với web search. Tìm kiếm thông tin realtime, trả lời kèm nguồn trích dẫn.

ChatWeb SearchCitationsStreaming

Imagen 4

Google Imagen 4 — tạo ảnh chất lượng cao, photorealistic, hỗ trợ text trong ảnh.

Image GenerationPhotorealisticText in Image

Imagen 4 Fast

Imagen 4 phiên bản nhanh — tạo ảnh nhanh hơn, phù hợp prototype và batch.

Image GenerationFastPhotorealistic

GPT Image 1.5

Model tạo ảnh mới nhất từ OpenAI. Hiểu ngữ cảnh tốt, tạo ảnh chất lượng cao.

Image GenerationImage EditingText in Image

GPT Image 1

OpenAI GPT Image — tạo ảnh từ text, chỉnh sửa ảnh, hỗ trợ text trong ảnh.

Image GenerationImage EditingText in Image

GPT Image 1 Mini

Phiên bản nhẹ GPT Image. Tạo ảnh nhanh, rẻ hơn, phù hợp prototype.

Image GenerationFast

DALL-E 3

Model tạo ảnh kinh điển của OpenAI. Sáng tạo, phong cách đa dạng.

Image GenerationCreative

Sora 2

Model tạo video từ OpenAI. Tạo video chất lượng cao từ text prompt.

Video GenerationText to Video

Sora 2 Pro

Sora 2 Pro — video chất lượng cao nhất, độ phân giải lớn, thời lượng dài hơn.

Video GenerationText to VideoUltra HD

GPT Audio 1.5

Model audio mới nhất — hiểu và tạo giọng nói, phiên dịch, tóm tắt audio.

Audio InputAudio OutputSpeechTranslation

GPT Audio

Model audio của OpenAI — hiểu và tạo giọng nói, phân tích audio.

Audio InputAudio OutputSpeech

GPT Audio Mini

Model audio nhẹ. Nhanh, rẻ, phù hợp voice chat đơn giản.

Audio InputAudio OutputSpeech

GPT Realtime 1.5

Model realtime mới nhất — hội thoại giọng nói hai chiều, độ trễ cực thấp.

RealtimeVoiceLow LatencyStreaming

GPT Realtime

Model realtime — hội thoại giọng nói hai chiều realtime qua WebSocket.

RealtimeVoiceLow LatencyStreaming

GPT Realtime Mini

Model realtime nhẹ. Giá rẻ hơn, phù hợp voice chat đơn giản.

RealtimeVoiceLow LatencyStreaming

TTS-1

Text-to-Speech từ OpenAI. 6 giọng đọc tự nhiên, hỗ trợ nhiều ngôn ngữ.

Text to SpeechMultiple VoicesMulti-language

Whisper-1

Speech-to-Text từ OpenAI. Nhận dạng giọng nói chính xác, hỗ trợ 99+ ngôn ngữ.

Speech to TextTranscriptionTranslationMulti-language

Together Qwen3.5 397B

Qwen3.5 397B MoE trên Together AI. Model open-source cực mạnh, ngang GPT-5.

ChatReasoningCodeMathStreaming

Together Qwen3 Coder 480B

Qwen3 Coder 480B — model code open-source lớn nhất. Xuất sắc cho programming.

CodeChatReasoningStreaming

Together Qwen3 Coder Next

Qwen3 Coder Next — phiên bản mới nhất, nâng cấp từ 480B, code tốt hơn.

CodeChatReasoningStreaming

Together Qwen3 235B

Qwen3 235B MoE — cân bằng tốt giữa chất lượng và giá. Reasoning mạnh.

ChatReasoningCodeMathStreaming

Together DeepSeek R1

DeepSeek R1 trên Together AI. Reasoning mạnh, giá cạnh tranh, tốc độ tốt.

ChatReasoningMathCodeStreaming

Together DeepSeek V3.1

DeepSeek V3.1 trên Together AI. Nâng cấp từ V3, mạnh hơn, nhanh hơn.

ChatFunction CallingCodeStreaming

Together Llama 4 Maverick

Meta Llama 4 Maverick MoE trên Together. Mạnh, nhanh, hỗ trợ vision.

ChatVisionFunction CallingStreaming

Together Cogito 671B

DeepCogito 671B — model reasoning mạnh, cạnh tranh với DeepSeek R1.

ChatReasoningCodeMathStreaming

Together MiniMax M2.5

MiniMax M2.5 — model từ MiniMax, mạnh với context dài và reasoning.

ChatReasoningStreaming

Together Kimi K2.5

Moonshot Kimi K2.5 trên Together AI. Reasoning mạnh, tốt với code và math.

ChatReasoningCodeMathStreaming

Together Qwen3.5 9B

Qwen3.5 9B — model nhẹ, rẻ, nhanh. Phù hợp chatbot và tác vụ đơn giản.

Together Llama 3.3 70B

Llama 3.3 70B trên Together AI. Đa năng, giá rẻ, tốc độ tốt.

ChatFunction CallingStreaming

Together Apriel 1.6 15B Thinker

Miễn phí Mới

ServiceNow Apriel 1.6 15B Thinker — model reasoning miễn phí, hỗ trợ chain-of-thought tốt.

ChatReasoningStreaming

Together Apriel 1.5 15B Thinker

Miễn phí Mới

ServiceNow Apriel 1.5 15B Thinker — phiên bản trước, reasoning miễn phí.

ChatReasoningStreaming

Together GLM-4.5 Air

Zhipu GLM-4.5 Air trên Together AI. Rẻ, hỗ trợ tiếng Trung và tiếng Anh tốt.

ChatCodeMulti-languageStreaming

Together Qwen3 VL 8B

Qwen3 Vision-Language 8B — model vision rẻ nhất, hỗ trợ phân tích hình ảnh.

ChatVisionStreaming

200+ Models via Pass-through

Tất cả providers

Ngoài 89 curated models ở trên, bạn có thể gọi bất kỳ model nào từ 11 providers bằng cách dùng format provider/model-id. Giá tự động tính theo markup.

OpenAI

20+ models

Anthropic

6+ models

Google

10+ models

DeepSeek

5+ models

Meta

10+ models

Groq

8+ models

Together AI

100+ models

Mistral

8+ models

C

Cohere

5+ models

P

Perplexity

4+ models

F

Fireworks

30+ models

OpenAI

gpt-4o-2024-08-06gpt-4-turboo1-previewo1-minio3-minigpt-4o-mini-2024-07-18...

Anthropic

claude-opus-4-6claude-3-haiku-20240307claude-3-opus-20240229...

Google

gemini-1.5-flashgemini-progemma-2-27b-it...

Mistral

mistral-large-latestmistral-mediumcodestral-latestmistral-small-latest...

Together AI

Qwen/Qwen2.5-72B-InstructNousResearch/Hermes-3-Llama-3.1-405Bdatabricks/dbrx-instruct...

Cohere

command-r-pluscommand-rembed-english-v3.0...

Perplexity

sonar-prosonarsonar-reasoning...

Fireworks

accounts/fireworks/models/llama-v3p1-405b-instructaccounts/fireworks/models/mixtral-8x22b-instruct...

Ví dụ: gọi bất kỳ model

# OpenAI specific model
model="openai/gpt-4o-2024-08-06"

# Mistral Large
model="mistral/mistral-large-latest"

# Qwen 72B via Together AI
model="together_ai/Qwen/Qwen2.5-72B-Instruct"

# Perplexity online search
model="perplexity/sonar-pro"