llms / directory
MODEL WEIGHTS▌
568listings · open vs closed weights · readme & download links
HY-World 2.0
openTencent-Hunyuan
HY-World 2.0 is a multi-modal world model framework for generating and reconstructing 3D worlds from various input modalities. It produces editable 3D assets that can be imported into game engines, offering capabilities for both world generation and reconstruction.
Seed3D 2.0
closedByteDance Seed
Seed3D 2.0 is a next-generation 3D generative model that delivers high-quality, production-grade 3D content with enhanced geometric precision and material realism. It introduces a two-stage generation strategy for improved detail and a unified PBR model for superior texture generation.
Grok 4.3
closedGrok 4.3 is a powerful model designed for function calling, structured outputs, and reasoning capabilities. It can connect to external tools and systems, providing organized responses.
Realtime TTS-2
closedInworld AI
Realtime TTS-2 is a new generation voice model from Inworld AI designed for real-time conversation. It captures the user's tone, pacing, and emotional state, providing a voice identity across over 100 languages.
Seed3D 2.0
closedByteDance
Seed3D 2.0 is a model developed by ByteDance that generates 3D objects from a single image or a text prompt. It aims to enhance 3D creation workflows significantly.
Mistral Medium 3.5
openMistral AI
Mistral Medium 3.5 is a flagship model designed for instruction-following, reasoning, and coding tasks. It operates as a dense 128B model with a 256k context window, enabling efficient performance in real-world applications.
VibeVoice
openMicrosoft
VibeVoice is a family of open-source frontier voice AI models that includes both Text-to-Speech (TTS) and Automatic Speech Recognition (ASR) models. It supports long-form audio processing and multilingual capabilities.
ACE-Step 1.5
openACE Music
ACE-Step 1.5 is a highly efficient open-source music foundation model that delivers commercial-grade music generation on consumer hardware. It supports lightweight personalization and runs locally with less than 4GB of VRAM.
Odyssey-2
closedOdyssey
Odyssey-2 is a frontier world model that generates interactive AI video in real time. You can type prompts and watch as the video evolves instantly, creating a unique experience for each user.
DeepSeek V4
openDeepSeek, Inc.
DeepSeek V4 is an open-source model offering cost-effective 1M context length with enhanced agentic capabilities and world-class reasoning. It includes two variants: V4-Pro and V4-Flash, catering to different performance needs.
Wan2.1
openWan-Video
Wan2.1 is an open suite of video foundation models that excels in video generation tasks including Text-to-Video, Image-to-Video, and Video Editing. It is designed to perform efficiently on consumer-grade GPUs while delivering state-of-the-art performance.
Seed2.0
closedSeed2.0 is a series of general-purpose agent models optimized for large-scale production deployment. It enhances multimodal understanding and LLM capabilities, making it suitable for complex real-world tasks.
GPT-5.5
closedOpenAI
GPT-5.5 is our smartest and most intuitive model yet, designed to enhance productivity on a computer. It understands tasks faster and uses fewer tokens for the same tasks, making it more efficient and capable.
Wan 2.7
openAlibaba Cloud
Wan 2.7 is an advanced AI model for video editing and image generation, allowing users to create and customize visuals with text prompts and multi-image guidance. It supports long-form text generation in multiple languages and offers precise control over color and image editing.
VOID: Video Object and Interaction Deletion
openNetflix
VOID removes objects from videos along with all interactions they induce on the scene. It handles not just secondary effects like shadows and reflections, but also physical interactions like objects falling when a person is removed.
Seedance 2.0
closedByteDance
Seedance 2.0 is a multi-modal audio-video generation model that supports text, image, audio, and video inputs with improved generation quality and speed. It delivers substantial improvements across all key sub-dimensions of video and audio generation.
Claude Opus 4.7
closedAnthropic
Claude Opus 4.7 is Anthropic's most capable generally available model, designed for complex reasoning and agentic coding. It supports text and image input, text output, and multilingual capabilities.
Gemini Robotics ER 1.6
closedGemini Robotics ER 1.6 is a vision-language model for robot reasoning. It handles spatial pointing, multi-view success detection, and instrument reading, making it ideal for robotics engineers and developers.
Claude Mythos Preview
closedAnthropic
Claude Mythos Preview is a general-purpose frontier model developed by Anthropic, designed to identify and exploit software vulnerabilities. It showcases advanced coding capabilities that surpass traditional methods of vulnerability detection.
GLM-5.1
openZ.ai
GLM-5.1 is a next-generation flagship model for agentic engineering, offering significantly stronger coding capabilities than its predecessor. It excels in handling ambiguous problems and sustains optimization over extended sessions.
wan2.5-t2i-preview
closedAlibaba
Alibaba · 1 Arena leaderboard
wan2.6-i2v
closedAlibaba
Alibaba · 1 Arena leaderboard
wan2.7-image
closedAlibaba
Alibaba · 2 Arena leaderboards
wizardlm-13b
openMicrosoft
Microsoft · 1 Arena leaderboard