Text Generation Inference
A toolkit for serving and deploying large language models (LLMs) for text generation via Rust, Python, and gRPC. It is optimized for inference and supports tensor parallelism for efficient scaling.
Key Information
- Category: Developer Tools
- Source: Github
- Tags: Python
- Last updated: January 09, 2026
Structured Metrics
No structured metrics captured yet.
Links
Canonical source: https://github.com/huggingface/text-generation-inference