llms / directory
MODEL WEIGHTS▌
373listings · open vs closed weights · readme & download links
Qwen3.7-Plus: Multimodal Agent Intelligence
closedQwenTeam
Qwen3.7-Plus is a multimodal agent model that integrates vision and language capabilities into a single foundation. It excels in coding, tool use, and productivity workflows, offering a versatile solution for software engineering and automation tasks.
Grok Build 0.1
closedxAI
Grok Build 0.1 is an intelligent coding model that powers the Grok Build CLI. It excels at agentic coding and is available via the xAI API in public beta.
Antigravity
closedAntigravity is an agent-first AI development platform by Google designed for autonomous coding agents. It allows users to manage complex workflows, from coding to testing and debugging, all performed by multiple agents in parallel.
Claude Opus 4.8
closedAnthropic
Claude Opus 4.8 introduces enhancements in coding, long-running agentic work, and complex knowledge tasks. It offers a fast mode for quicker outputs and a new effort dial for response customization.
Aleph 2.0
closedRunway
Aleph 2.0 is an upgraded video editing model that allows users to modify video content efficiently. It enables users to edit a single frame and apply those changes across the entire video while preserving unaltered elements.
Qwen 3.7-Max
closedQwenTeam
Qwen 3.7-Max is a proprietary model designed for the agent era, excelling in coding, office automation, and long-horizon reasoning tasks. It offers versatile capabilities for writing and debugging code, automating workflows, and executing complex tasks autonomously.
Composer 2.5
closedCursor
Composer 2.5 is a substantial improvement over its predecessor, offering enhanced intelligence and behavior for long-running tasks. It excels in following complex instructions and provides a more pleasant collaboration experience.
Gemini 3.5 Flash
closedGoogle DeepMind
Gemini 3.5 Flash is designed for executing complex, agentic workflows with exceptional speed and intelligence. It excels in coding and long-horizon tasks, providing real-world utility for developers and enterprises.
Starchild-1: The First Real-Time Multimodal World Model
closedOdyssey
Starchild-1 is the world's first multimodal world model that generates synchronized audio and video in real-time while responding to user input. It represents a significant advancement in generative intelligence by learning directly from the world through large-scale video.
Perception 1.0
closedCeptory
Perception 1.0 is the core model layer behind Ceptory's enterprise video intelligence, enabling natural language search, multimodal analysis, and operational monitoring. It provides structured outputs ready for API integration and supports retrieval from large video libraries.
MiniMax M2.5
closedMiniMax
MiniMax M2.5 is a state-of-the-art model designed for real-world productivity, excelling in coding, agentic tool use, and office work. It offers significant improvements in task completion speed and cost-effectiveness, making it ideal for complex applications.
Grok 4.3
closedOur most advanced flagship model, leading the industry in non-hallucination rate, agentic tool calling, and instruction following capabilities.
GPT-Realtime-2
closedOpenAI
GPT-Realtime-2 is OpenAI's most intelligent voice model yet, featuring GPT-5-level reasoning and a 128,000-token context window. It enables real-time voice interactions and can handle complex conversations seamlessly.
ElevenAgents
closedElevenLabs
ElevenAgents is a multimodal AI agent that processes images, PDFs, audio messages, and more across various channels. It enables seamless interactions by maintaining full context during conversations.
Seed3D 2.0
closedByteDance Seed
Seed3D 2.0 is a next-generation 3D generative model that delivers high-quality, production-grade 3D content with enhanced geometric precision and material realism. It introduces a two-stage generation strategy for improved detail and a unified PBR model for superior texture generation.
Grok 4.3
closedGrok 4.3 is a powerful model designed for function calling, structured outputs, and reasoning capabilities. It can connect to external tools and systems, providing organized responses.
Realtime TTS-2
closedInworld AI
Realtime TTS-2 is a new generation voice model from Inworld AI designed for real-time conversation. It captures the user's tone, pacing, and emotional state, providing a voice identity across over 100 languages.
Seed3D 2.0
closedByteDance
Seed3D 2.0 is a model developed by ByteDance that generates 3D objects from a single image or a text prompt. It aims to enhance 3D creation workflows significantly.
Odyssey-2
closedOdyssey
Odyssey-2 is a frontier world model that generates interactive AI video in real time. You can type prompts and watch as the video evolves instantly, creating a unique experience for each user.
Seed2.0
closedSeed2.0 is a series of general-purpose agent models optimized for large-scale production deployment. It enhances multimodal understanding and LLM capabilities, making it suitable for complex real-world tasks.
GPT-5.5
closedOpenAI
GPT-5.5 is our smartest and most intuitive model yet, designed to enhance productivity on a computer. It understands tasks faster and uses fewer tokens for the same tasks, making it more efficient and capable.
Seedance 2.0
closedByteDance
Seedance 2.0 is a multi-modal audio-video generation model that supports text, image, audio, and video inputs with improved generation quality and speed. It delivers substantial improvements across all key sub-dimensions of video and audio generation.
Claude Opus 4.7
closedAnthropic
Claude Opus 4.7 is Anthropic's most capable generally available model, designed for complex reasoning and agentic coding. It supports text and image input, text output, and multilingual capabilities.
Gemini Robotics ER 1.6
closedGemini Robotics ER 1.6 is a vision-language model for robot reasoning. It handles spatial pointing, multi-view success detection, and instrument reading, making it ideal for robotics engineers and developers.