LongCat-Flash-Chat

Discover LongCat-Flash-Chat, an open-source LLM by Meituan, featuring innovative architecture for efficient, cost-effective AI dialogue and reasoning tasks.

Author: Meituan

Added Date: 2025-09-05

Open Source Since: 2025-08-30

Visit Website GitHub

Introduction

LongCat-Flash-Chat is an open-source LLM by Meituan, utilizing a novel Mixture-of-Experts (MoE) architecture for dynamic computation and efficient inference at low cost.

Key Features

Innovative Zero-Computation Experts mechanism for superior inference efficiency
High concurrency and multi-agent task support with outstanding performance
PID controller for expert bias fine-tuning and flexible resource allocation

Use Cases

Agent-based dialogue and complex reasoning tasks
Enterprise-level QA and multi-scenario applications
Efficient inference and cost-sensitive deployments

Technical Highlights

560B total parameters, dynamically activates 18.6B~31.3B parameters
Supports SGLang/vLLM deployment solutions
MIT License, supports model distillation and transfer learning

LongCat-Flash-Chat

Introduction

Key Features

Use Cases

Technical Highlights

Resource Info

Related Resources

Nano-vLLM

DeepSeek-OCR

LeRobot