FAST: Efficient Action Tokenization for Vision-Language-Action Models - AI Robotics Tool

Overview

FAST is a universal action tokenizer that maps robot action sequences into dense, discrete tokens for training autoregressive vision-language-action models. The project provides a pre-trained tokenizer and tools to train a custom tokenizer on your own action data, published as a Hugging Face repository for robotics research and development.

Key Features

  • Universal action tokenizer for robot action sequences
  • Produces dense, discrete tokens for autoregressive models
  • Supports using a provided pre-trained tokenizer
  • Allows custom tokenizer training on user datasets
  • Hosted as a Hugging Face repository for easy access

Ideal Use Cases

  • Training autoregressive vision-language-action models
  • Tokenizing robotic action sequences for model inputs
  • Creating custom action vocabularies from collected data
  • Research in robotics, imitation, and language-conditioned control

Getting Started

  • Open the FAST repository on Hugging Face.
  • Clone or download the repository to your workspace.
  • Install the repository dependencies per instructions.
  • Load the provided pre-trained tokenizer for evaluation.
  • Prepare your action sequence dataset for custom training.
  • Run the repository's custom-training tools with your data.

Pricing

No pricing information disclosed for this repository or tools.

Key Information

  • Category: Robotics
  • Type: AI Robotics Tool