Model Landscape — June 2026

Purpose: Comprehensive reference of frontier LLM models, their specs, pricing, modalities, and best-fit use cases for building a model-agnostic agent harness.

Date researched: 2026-06-23

Source: OpenRouter model pages (live data)

1. What Was Researched

A curated set of 25 models across 11 providers was profiled using OpenRouter's standardized cost/benchmark data. Models were selected to represent the full spectrum needed by a multi-model agent harness:

Frontier reasoning models (primary agent loop)
Coding-specialized models (subagent coding tasks)
Fast/flash models (high-throughput, low-latency tier)
Nano/mini models (cost-efficient inner loops, classification)
Audio/voice models (voice integration)
Embedding models (RAG, semantic search, memory)
Reranking models (retrieval pipeline optimization)

2. Sources

All data sourced from OpenRouter model pages on 2026-06-23:

#	URL	Model
1	https://openrouter.ai/x-ai/grok-4.3	Grok 4.3
2	https://openrouter.ai/z-ai/glm-5.2	GLM 5.2
3	https://openrouter.ai/moonshotai/kimi-k2.7-code	Kimi K2.7 Code
4	https://openrouter.ai/anthropic/claude-fable-5	Claude Fable 5
5	https://openrouter.ai/nvidia/nemotron-3-ultra-550b-a55b	Nemotron 3 Ultra
6	https://openrouter.ai/qwen/qwen3.7-plus	Qwen 3.7 Plus
7	https://openrouter.ai/minimax/minimax-m3	MiniMax M3
8	https://openrouter.ai/stepfun/step-3.7-flash	Step 3.7 Flash
9	https://openrouter.ai/anthropic/claude-opus-4.8	Claude Opus 4.8
10	https://openrouter.ai/qwen/qwen3.7-max	Qwen 3.7 Max
11	https://openrouter.ai/google/gemini-3.5-flash	Gemini 3.5 Flash
12	https://openrouter.ai/x-ai/grok-voice-tts-1.0	Grok Voice TTS 1.0
13	https://openrouter.ai/openai/gpt-5.4-mini	GPT-5.4 Mini
14	https://openrouter.ai/openai/gpt-5.5	GPT-5.5
15	https://openrouter.ai/openai/gpt-5.4-nano	GPT-5.4 Nano
16	https://openrouter.ai/openai/gpt-audio-mini	GPT Audio Mini
17	https://openrouter.ai/openai/gpt-audio	GPT Audio
18	https://openrouter.ai/cohere/rerank-4-pro	Rerank 4 Pro
19	https://openrouter.ai/cohere/rerank-4-fast	Rerank 4 Fast
20	https://openrouter.ai/cohere/rerank-v3.5	Rerank v3.5
21	https://openrouter.ai/google/gemini-embedding-2	Gemini Embedding 2
22	https://openrouter.ai/openai/text-embedding-3-large	Text Embedding 3 Large
23	https://openrouter.ai/openai/text-embedding-3-small	Text Embedding 3 Small
24	https://openrouter.ai/deepseek/deepseek-v4-pro	DeepSeek V4 Pro
25	https://openrouter.ai/deepseek/deepseek-v4-flash	DeepSeek V4 Flash

3. Model Profiles — Frontier Reasoning Models

3.1 xAI Grok 4.3

Attribute	Value
API ID	`x-ai/grok-4.3`
Provider	xAI
Type	Reasoning model
Modalities	Text + Image → Text
Context Window	1,000,000 tokens
Max Output	Unlimited
Input Price	$1.25 / 1M tokens
Output Price	$2.50 / 1M tokens
Released	Apr 30, 2026
Reasoning Config	none / low / medium / high (default: low)
Tiered Pricing	Requests >200K total tokens billed at higher rate

Description: Reasoning model suited for agentic workflows, instruction-following tasks, and applications requiring high factual accuracy. Supports configurable reasoning effort levels. 1M context with no output limit makes it excellent for long-document analysis, deep research, and multi-step agentic tasks.