Best AI SDKs & Libraries Tools
Explore 37 AI sdks & libraries tools to find the perfect solution.
SDKs & Libraries
37 toolsDiffusers (Hugging Face)
Open‑source library for state‑of‑the‑art pretrained diffusion models for images, audio, and video, covering training and optimized inference.
Hugging Face Chat UI
An open-source codebase that powers customizable chat user interfaces, allowing deployment of chatbot instances integrated with various supported language models.
LangChain
A comprehensive framework for building context-aware applications powered by large language models, featuring standard interfaces for models, embeddings, and vector stores.
Dynamic Speculation
A novel method developed by Intel labs and Hugging Face that accelerates text generation by up to 2.7x using dynamic speculation lookahead in language models, integrated into the Transformers library.
Hugging Face Transformers
A comprehensive library of pretrained models for text, vision, audio, video, and multimodal tasks, enabling fine-tuning and inference across many generative AI use cases.
AI SDK
AI SDK is a free, open-source TypeScript toolkit that helps developers build AI-powered applications and agents using frameworks such as Next.js, React, Svelte, and Vue, as well as Node.js runtime. It provides a unified API to interact with various model providers like OpenAI, Anthropic, and Google.
crewAI Tools
An open-source toolkit that provides a comprehensive guide and pre-built modules for integrating and creating custom tools for CrewAI agents. It includes implementations for file operations, web scraping, database interactions, API integrations, and AI-powered functionalities, supporting developers in enhancing AI agent capabilities.
OpenAI Realtime Agents
A demonstration repository showcasing advanced, agentic patterns built on top of OpenAI's Realtime API. It provides a Next.js/TypeScript example for prototyping multi-agent realtime voice applications, including sequential agent handoffs and state machine based interactions.
Mem0
A memory management tool for AI agents that adds an intelligent memory layer enabling personalized interactions, featuring secure and scalable long-term memory management.
Chainlit
Chainlit is an open‐source Python framework that enables developers to build production-ready conversational AI applications quickly. It provides a user-friendly interface, optimized step functions, and seamless integration with LLM tools, making it easier to create interactive chatbot experiences.
Unsloth
Unsloth is an open-source tool that enables developers to finetune various large language models (such as Llama 4, DeepSeek-R1, Gemma 3, and others) more efficiently. It offers free notebooks, reduced memory usage through dynamic quantization, and faster training performance, making it easier to deploy optimized models to platforms like GGUF, Ollama, vLLM, and Hugging Face.
Coqui TTS
A deep learning toolkit for advanced Text-to-Speech generation, providing pretrained models across 1100+ languages, tools for training and fine-tuning models, and utilities for dataset analysis. Battle-tested in both research and production environments.
Hugging Face Speech-to-Speech
An open-sourced, modular speech-to-speech pipeline developed by Hugging Face that integrates Voice Activity Detection, Speech-to-Text, Language Models, and Text-to-Speech. It leverages models from the Transformers library (e.g., Whisper, Parler-TTS) and supports various deployment approaches including server/client and local setups.
Generative AI Toolkit
An open source toolkit by AWS Labs that includes a tool registry and decorators, aimed at building generative AI applications.
NVIDIA NeMo
A scalable generative AI framework for building large language, multimodal, and speech AI models with various optimizations.
Vercel AI SDK
TypeScript toolkit for building AI apps and agents with streaming UIs, tool use, and provider integrations.
MCP Calculator
Xiaozhi MCP sample program is an open-source project demonstrating the Model Context Protocol (MCP), which enables language models to invoke external tools such as calculators, email operations, knowledge search, and more. It features secure WebSocket communication, real-time streaming, automatic reconnection, and a simple interface for tool creation.
Stability AI Generative Models
Open-source generative models and tooling from Stability AI (e.g., Stable Video 4D 2.0, image/video generation code).
spaCy Models
A GitHub repository by explosion that distributes pre-trained model packages for the spaCy NLP library. The repository provides model releases in .whl and .tar.gz formats for various NLP tasks (e.g., tagging, parsing, lemmatization, and named entity recognition) along with versioning and compatibility guidelines.
Tesseract OCR
Tesseract OCR is an open-source optical character recognition engine that can recognize text from images. It supports over 100 languages, multiple image formats (PNG, JPEG, TIFF), and offers both an LSTM-based OCR engine and a legacy mode for character pattern recognition.
Graphiti
Graphiti is a framework for building and querying temporally-aware, real-time knowledge graphs tailored for dynamic AI agents. It continuously integrates user interactions, structured enterprise data, and external information, enabling state-based reasoning, task automation, and precise historical queries without complete graph recomputation. It also serves as the core memory layer for Zep’s AI agents.
AIAccess PHP Library
Unified PHP SDK to access multiple AI providers (OpenAI, Claude, Gemini, DeepSeek, Grok).
Jupyter AI
Jupyter AI is a generative AI extension for JupyterLab that integrates advanced AI capabilities directly into Jupyter notebooks. It provides a %%ai magic command for turning notebooks into a generative AI playground, a native chat UI, and supports a wide range of AI model providers, including local support via GPT4All and Ollama.
vercel/ai
A developer-focused AI toolkit released by Vercel that includes libraries and tools for integrating AI capabilities into applications.
AI SDK Provider for Gemini CLI
An unofficial community provider for the Vercel AI SDK that enables the integration of Google’s Gemini models via the Gemini CLI Core library and Google Cloud Code endpoints. It offers streaming support, multimodal capabilities, tool/function calling, OAuth authentication, and full TypeScript support with Zod schemas.
chat-ui by huggingchat
A live chat interface hosted as a Hugging Face Space by the huggingchat team, allowing users to interact with conversational AI models through an intuitive web-based application.
MCP Toolbox for Databases
MCP Toolbox for Databases is an open source MCP server that simplifies building Gen AI tools for database access. It handles complexities such as connection pooling, authentication, and observability, enabling seamless integration of database functionalities into AI workflows.
AWS Generative AI Toolkit
A toolkit from AWS Labs for generative AI featuring multi-modal responses, tool registries, and integrated testing methods.
NVIDIA NeMo
Framework for training, customizing, and deploying LLMs, speech, and multimodal models, including support for video/world models and optimized pipelines.
genai-toolbox
An open‐source generative AI toolbox by googleapis that aggregates multiple tools and modules for integrating generative AI features (such as authentication, data source connectors for BigQuery, MySQL, ClickHouse, Looker, etc.) into applications. It is designed for developers to build and deploy AI-powered workflows and bridges between various data sources and AI models.
MediaPipe
MediaPipe is an open-source framework by Google AI Edge designed for building cross-platform multimodal machine learning pipelines, especially for computer vision and media processing tasks. It provides ready-to-use components and tools for rapid prototyping and deployment in AI applications.
Google AI Edge Gallery
Android app and toolkit to run generative AI models fully on‑device using MediaPipe/LiteRT
Node CharacterAI
A Node.js client library that acts as an unofficial wrapper for the Character AI API. It enables developers to create and manage chats with Character AI using guest or token authentication, and supports asynchronous messaging and conversation management.
AI-Tools
A Node.js package that provides a set of API client tools to access beta AI functionalities such as NSFW image detection and face detection. It wraps requests to a custom API server, enabling developers to integrate these AI capabilities into their applications.
Google Gen AI Python SDK
Official Python SDK for Google’s Generative AI APIs (Gemini), supporting chat, tools, function calling, images, and video.
GitHub MCP Server
GitHub’s official Model Context Protocol (MCP) server that lets AI agents and chat assistants securely access GitHub context and actions via natural language. It enables repository browsing and code search, issue/PR creation and management, Actions/workflow insights, security findings and Dependabot alerts review, and team collaboration data. Supported as a remote, GitHub‑hosted MCP server (OAuth or PAT) with quick setup in VS Code 1.101+ and other MCP hosts; a local server option is also available.
Spring AI
Spring AI is the official Spring Framework library for building AI-powered Java/Spring applications. It provides abstractions and starters for LLM chat/completions, embeddings, tool calling (MCP), memory/vector stores, and integrations with providers like OpenAI, Anthropic, ElevenLabs, MongoDB, and PgVector. The 1.1.0 GA release adds dynamic request field deserialization and includes multiple bug fixes, documentation improvements, and an MCP upgrade.