llms / directory

MODEL WEIGHTS

568listings · open vs closed weights · readme & download links

HY-World 2.0

open

Tencent-Hunyuan

HY-World 2.0 is a multi-modal world model framework for generating and reconstructing 3D worlds from various input modalities. It produces editable 3D assets that can be imported into game engines, offering capabilities for both world generation and reconstruction.

generative-media· ~1.2B
0 · 0 commentsweights link →

Seed3D 2.0

closed

ByteDance Seed

Seed3D 2.0 is a next-generation 3D generative model that delivers high-quality, production-grade 3D content with enhanced geometric precision and material realism. It introduces a two-stage generation strategy for improved detail and a unified PBR model for superior texture generation.

generative-media
0 · 0 comments

Grok 4.3

closed

Grok 4.3 is a powerful model designed for function calling, structured outputs, and reasoning capabilities. It can connect to external tools and systems, providing organized responses.

language· 1,000,000 ctx
0 · 0 comments

Realtime TTS-2

closed

Inworld AI

Realtime TTS-2 is a new generation voice model from Inworld AI designed for real-time conversation. It captures the user's tone, pacing, and emotional state, providing a voice identity across over 100 languages.

voice
0 · 0 comments

Seed3D 2.0

closed

ByteDance

Seed3D 2.0 is a model developed by ByteDance that generates 3D objects from a single image or a text prompt. It aims to enhance 3D creation workflows significantly.

generative-media
0 · 0 comments

Mistral Medium 3.5

open

Mistral AI

Mistral Medium 3.5 is a flagship model designed for instruction-following, reasoning, and coding tasks. It operates as a dense 128B model with a 256k context window, enabling efficient performance in real-world applications.

language· 128B· 256,000 ctx
1 · 0 commentsweights link →

VibeVoice

open

Microsoft

VibeVoice is a family of open-source frontier voice AI models that includes both Text-to-Speech (TTS) and Automatic Speech Recognition (ASR) models. It supports long-form audio processing and multilingual capabilities.

speech· 64,000 ctx
0 · 0 comments

ACE-Step 1.5

open

ACE Music

ACE-Step 1.5 is a highly efficient open-source music foundation model that delivers commercial-grade music generation on consumer hardware. It supports lightweight personalization and runs locally with less than 4GB of VRAM.

generative-media· 4B
0 · 0 commentsweights link →

Odyssey-2

closed

Odyssey

Odyssey-2 is a frontier world model that generates interactive AI video in real time. You can type prompts and watch as the video evolves instantly, creating a unique experience for each user.

generative-media
0 · 0 comments

DeepSeek V4

open

DeepSeek, Inc.

DeepSeek V4 is an open-source model offering cost-effective 1M context length with enhanced agentic capabilities and world-class reasoning. It includes two variants: V4-Pro and V4-Flash, catering to different performance needs.

language· 1.6T / 284B· 1,000,000 ctx
0 · 0 commentsweights link →

Wan2.1

open

Wan-Video

Wan2.1 is an open suite of video foundation models that excels in video generation tasks including Text-to-Video, Image-to-Video, and Video Editing. It is designed to perform efficiently on consumer-grade GPUs while delivering state-of-the-art performance.

generative-media· 14B
0 · 0 commentsweights link →

Seed2.0

closed

Seed2.0 is a series of general-purpose agent models optimized for large-scale production deployment. It enhances multimodal understanding and LLM capabilities, making it suitable for complex real-world tasks.

multimodal
0 · 0 comments

GPT-5.5

closed

OpenAI

GPT-5.5 is our smartest and most intuitive model yet, designed to enhance productivity on a computer. It understands tasks faster and uses fewer tokens for the same tasks, making it more efficient and capable.

language
0 · 0 comments

Wan 2.7

open

Alibaba Cloud

Wan 2.7 is an advanced AI model for video editing and image generation, allowing users to create and customize visuals with text prompts and multi-image guidance. It supports long-form text generation in multiple languages and offers precise control over color and image editing.

generative-media
0 · 0 comments

VOID: Video Object and Interaction Deletion

open

Netflix

VOID removes objects from videos along with all interactions they induce on the scene. It handles not just secondary effects like shadows and reflections, but also physical interactions like objects falling when a person is removed.

video-to-video· 5B
0 · 0 commentsweights link →

Seedance 2.0

closed

ByteDance

Seedance 2.0 is a multi-modal audio-video generation model that supports text, image, audio, and video inputs with improved generation quality and speed. It delivers substantial improvements across all key sub-dimensions of video and audio generation.

generative-media
0 · 0 comments

Claude Opus 4.7

closed

Anthropic

Claude Opus 4.7 is Anthropic's most capable generally available model, designed for complex reasoning and agentic coding. It supports text and image input, text output, and multilingual capabilities.

language· 1,000,000 ctx
0 · 0 comments

Gemini Robotics ER 1.6

closed

Google

Gemini Robotics ER 1.6 is a vision-language model for robot reasoning. It handles spatial pointing, multi-view success detection, and instrument reading, making it ideal for robotics engineers and developers.

vision-language
0 · 0 comments

Claude Mythos Preview

closed

Anthropic

Claude Mythos Preview is a general-purpose frontier model developed by Anthropic, designed to identify and exploit software vulnerabilities. It showcases advanced coding capabilities that surpass traditional methods of vulnerability detection.

code
0 · 0 commentsweights link →

GLM-5.1

open

Z.ai

GLM-5.1 is a next-generation flagship model for agentic engineering, offering significantly stronger coding capabilities than its predecessor. It excels in handling ambiguous problems and sustains optimization over extended sessions.

code· 754B
0 · 0 commentsweights link →

wan2.5-t2i-preview

closed

Alibaba

Alibaba · 1 Arena leaderboard

generative-media
0 · 0 comments

wan2.6-i2v

closed

Alibaba

Alibaba · 1 Arena leaderboard

generative-media
0 · 0 comments

wan2.7-image

closed

Alibaba

Alibaba · 2 Arena leaderboards

generative-media
0 · 0 comments

wizardlm-13b

open

Microsoft

Microsoft · 1 Arena leaderboard

language· 13B
0 · 0 commentsweights link →