LongCat-Flash-Chat

Discover LongCat-Flash-Chat, an open-source LLM by Meituan, featuring innovative architecture for efficient, cost-effective AI dialogue and reasoning tasks.

Meituan · Since 2025-08-30

Loading score...

GitHub Website

Introduction

LongCat-Flash-Chat is an open-source LLM by Meituan, utilizing a novel Mixture-of-Experts (MoE) architecture for dynamic computation and efficient inference at low cost.

Key Features

Innovative Zero-Computation Experts mechanism for superior inference efficiency
High concurrency and multi-agent task support with outstanding performance
PID controller for expert bias fine-tuning and flexible resource allocation

Use Cases

Agent-based dialogue and complex reasoning tasks
Enterprise-level QA and multi-scenario applications
Efficient inference and cost-sensitive deployments

Technical Highlights

560B total parameters, dynamically activates 18.6B~31.3B parameters
Supports SGLang/vLLM deployment solutions
MIT License, supports model distillation and transfer learning

Core Content

Core Content

Technology

Technology

More

More

AI Infrastructure

AI Infrastructure

Explore

Explore

Connect

Connect

Quick Links

Quick Links

LinkedIn

LinkedIn

Follow on X

Follow on X

LongCat-Flash-Chat

Introduction

Key Features

Use Cases

Technical Highlights

Score Breakdown

Related Resources

AutoSubs

Axolotl

Cactus