Label, clean and enrich text datasets with LLMs.
Autolabel is a Python library that automates the labeling, cleaning, and enrichment of text datasets by routing them through large language models via a JSON configuration file. Users define a task type (classification, named entity recognition, question answering, and others), specify labeling guidelines and few-shot examples, select a model provider, then call a LabelingAgent to run the labeling pipeline against a CSV dataset. It connects to Claude through Anthropic's API, supporting models such as claude-3-opus-20240229 alongside OpenAI, Google, and HuggingFace-hosted models. A built-in plan step previews the final prompt and estimates cost before any labels are generated, a practical safeguard for large datasets. The library also includes a benchmarking suite that runs identical prompts across all supported models and outputs a results.csv for direct comparison. The primary audience is machine learning engineers and data scientists who need annotated training data at lower cost and faster turnaround than manual labeling workflows.
- ✓Open-source license (MIT)
- ✓Healthy fork ratio
- ✓Clear description
- ✓Topics declared
- ✓Mature repo (>1y old)
- ✓Documented (README)
- !Stale (last commit >463d ago)
git clone https://github.com/refuel-ai/autolabelResumen de Tools
Lo que la gente pregunta sobre autolabel
¿Qué es refuel-ai/autolabel?
+
refuel-ai/autolabel es tools para el ecosistema de Claude AI. Label, clean and enrich text datasets with LLMs. Tiene 2.3k estrellas en GitHub y se actualizó por última vez 1y ago.
¿Cómo se instala autolabel?
+
Puedes instalar autolabel clonando el repositorio (https://github.com/refuel-ai/autolabel) o siguiendo las instrucciones del README en GitHub. ClaudeWave también te ofrece bloques de instalación rápida en esta misma página.
¿Es seguro usar refuel-ai/autolabel?
+
Nuestro agente de seguridad ha analizado refuel-ai/autolabel y le ha asignado un Trust Score de 80/100 (tier: Trusted). Revisa el desglose completo de comprobaciones superadas y flags en esta página.
¿Quién mantiene refuel-ai/autolabel?
+
refuel-ai/autolabel es mantenido por refuel-ai. La última actividad registrada en GitHub es de 1y ago, con 81 issues abiertos.
¿Hay alternativas a autolabel?
+
Sí. En ClaudeWave puedes explorar tools similares en /categories/tools, ordenados por popularidad o actividad reciente.
Despliega autolabel en tu cloud
Lleva este repo a producción en minutos. Cada plataforma genera su propio entorno con variables de entorno editables.
¿Mantienes este repo? Añade un badge a tu README
Pega el badge en tu README de GitHub para mostrar que está auditado por ClaudeWave. Cada badge enlaza de vuelta a esta página y muestra el Trust Score actual.
[](https://claudewave.com/repo/refuel-ai-autolabel)<a href="https://claudewave.com/repo/refuel-ai-autolabel"><img src="https://claudewave.com/api/badge/refuel-ai-autolabel" alt="Featured on ClaudeWave: refuel-ai/autolabel" width="320" height="64" /></a>Más Tools
A single CLAUDE.md file to improve Claude Code behavior, derived from Andrej Karpathy's observations on LLM coding pitfalls.
An AI SKILL that provide design intelligence for building professional UI/UX multiple platforms
🪨 why use many token when few token do trick — Claude Code skill that cuts 65% of tokens by talking like caveman
AI coding assistant skill (Claude Code, Codex, OpenCode, Cursor, Gemini CLI, and more). Turn any folder of code, SQL schemas, R scripts, shell scripts, docs, papers, images, or videos into a queryable knowledge graph. App code + database schema + infrastructure in one graph.
A light-weight and powerful meta-prompting, context engineering and spec-driven development system for Claude Code by TÂCHES.
CLI proxy that reduces LLM token consumption by 60-90% on common dev commands. Single Rust binary, zero dependencies