📖 AI-Native Infrastructure: Architecture evolution guide from cloud-native to AI-native

AngelSlim

AngelSlim is a model compression toolkit from Tencent providing easy-to-use compression and quantization workflows for efficient deployment.

Tencent · Since 2025-07-04
Loading score...

Detailed Introduction

AngelSlim is a model compression toolkit developed by Tencent. It provides a set of practical compression, quantization, and inference acceleration tools designed to make model deployment more efficient and reproducible for engineering teams.

Main Features

  • Supports multiple compression strategies and quantization techniques.
  • Optimized inference workflows and deployment guides.
  • Engineered for usability and production readiness.
  • Comprehensive documentation and examples for quick onboarding.

Use Cases

Suitable for deploying large models in constrained compute environments such as edge devices, production inference services, and cost-sensitive applications.

Technical Features

Focuses on combining compression algorithms with inference efficiency, offering quantization, pruning, and graph optimization techniques alongside reproducible engineering workflows.

AngelSlim
Score Breakdown
⚡ Optimization 🏗️ Model 🔮 Inference