Yi - AI Language Models Tool

Overview

Yi is an open-source family of bilingual (English/Chinese) large language models developed by 01.AI. The project provides multiple model sizes (notably 6B, 9B and 34B) plus chat-tuned variants, long-context (200K) checkpoints, multimodal (Yi-VL) variants, and specialized code models. Yi was trained on a heavily engineered 3.1 trillion-token bilingual corpus and emphasizes data quality (cascaded filtering and deduplication) and cost-effective inference — the team reports achieving GPT-3.5-level benchmark parity for many tasks while keeping model sizes and serving costs practical. ([github.com](https://github.com/01-ai/Yi)) Practical strengths include: a compact yet capable 34B variant aimed at an efficient performance/cost tradeoff, a 9B variant optimized for coding and math, and quantized 4-bit/8-bit chat releases that enable local deployment on consumer GPUs (e.g., 3090/4090). Yi’s release approach is explicitly open-source (Apache-2.0 license) with weights distributed on Hugging Face / ModelScope / other hubs, and the project publishes a technical report describing architecture, pretraining, and evaluations. These choices make Yi suitable for researchers, small-to-medium enterprise products, and local/private deployments where data-control and cost matter. ([ar5iv.org](https://ar5iv.org/abs/2403.04652))

GitHub Statistics

Stars: 7,843
Forks: 489
Contributors: 44
License: Apache-2.0
Primary Language: Jupyter Notebook
Last Updated: 2024-11-27T09:25:55Z

The main repository is active and well-populated: it shows ~7.8k stars, ~489 forks and 44 contributors, with frequent README/coordination commits through November 27, 2024. The repo includes extensive docs (quick-starts for pip, Docker, llama.cpp), quantization and finetuning guidance, and a Cookbook for examples — indicating good investment in usability and onboarding. Issues and PR activity is moderate (single-digit open issues, a dozen PRs at the time of check), which is typical for a high-profile model project that centralizes model weights and docs while encouraging downstream forks and community packaging. ([github.com](https://github.com/01-ai/Yi))

Installation

Install via pip:

git clone https://github.com/01-ai/Yi.git

cd Yi

pip install -r requirements.txt

python demo/web_demo.py -c <your-model-path>    # start lightweight local web demo

docker run -it --gpus all -v <your-model-path>:/models ghcr.io/01-ai/yi:latest    # run Yi in container

git clone https://github.com/ggerganov/llama.cpp.git    # for GGUF/llama.cpp quantized usage

GIT_LFS_SKIP_SMUDGE=1 git clone https://huggingface.co/XeIaso/yi-chat-6B-GGUF && git-lfs pull --include yi-chat-6b.Q2_K.gguf    # example GGUF download

Key Features

Bilingual pretraining on ~3.1T engineered English/Chinese tokens for strong cross-lingual performance.
Multiple sizes: Yi-6B, Yi-9B, Yi-34B (and chat variants) for varied compute/capability needs.
200K long-context variants (Yi-34B-200K) for retrieval and long-document tasks.
Yi-VL multimodal models that combine language and vision encoders for image + text tasks.
Quantized releases (4-bit AWQ, 8-bit GPTQ) enabling local deployment on consumer GPUs.

Community

The project has attracted substantial community interest (thousands of stars and hundreds of forks) and is widely mirrored on hubs like Hugging Face and Replicate, with community quantized builds and tutorials. Users and third-party packagers praise Yi’s benchmark results and long-context capabilities, while some community threads note that base (non-chat) checkpoints require task-specific tuning for good conversational behavior. The team provides regular updates (model releases, Yi Cookbook, technical report) and public discussion channels (GitHub, Discord), facilitating adoption and downstream contributions. ([github.com](https://github.com/01-ai/Yi))

Last Refreshed: 2026-03-03

GitHub

Key Information

Category: Language Models
Type: AI Language Models Tool

Visit Official Website

Yi - AI Language Models Tool

Overview

GitHub Statistics

Installation

Key Features

Community

Key Information

Related Tools

Qwen2.5-7B

DeepSeek-V3

Llama 3

UNfilteredAI-1B

Shuttle-3

WizardLM