Skip to main content
ClaudeWave

A Python Module Which Compacts LLM Prompts

MCP ServersOfficial Registry0 stars0 forksPythonMITUpdated 2d ago
Install in Claude Code / Claude Desktop
Method: pip / Python · compactprompt
Claude Code CLI
claude mcp add compact-prompt -- python -m compactprompt
claude_desktop_config.json (Claude Desktop)
{
  "mcpServers": {
    "compact-prompt": {
      "command": "python",
      "args": ["-m", "compactprompt"]
    }
  }
}
1. Run the command above in your terminal (Claude Code), or paste the JSON config into claude_desktop_config.json (Claude Desktop).
2. Replace any <placeholder> values with your API keys or paths.
3. Restart Claude. The MCP server and its tools appear automatically.
💡 Install first: pip install compactprompt
Use cases

MCP Servers overview

# CompactPrompt

[![Tests](https://github.com/gtkcyber/compact_prompt/actions/workflows/tests.yml/badge.svg)](https://github.com/gtkcyber/compact_prompt/actions/workflows/tests.yml)
[![Pylint](https://github.com/gtkcyber/compact_prompt/actions/workflows/pylint.yml/badge.svg)](https://github.com/gtkcyber/compact_prompt/actions/workflows/pylint.yml)
[![Documentation Status](https://readthedocs.org/projects/compact-prompt/badge/?version=latest)](https://compact-prompt.readthedocs.io/en/latest/)
![PyPI - Version](https://img.shields.io/pypi/v/compactprompt)
[![gtkcyber/compact_prompt MCP server](https://glama.ai/mcp/servers/gtkcyber/compact_prompt/badges/score.svg)](https://glama.ai/mcp/servers/gtkcyber/compact_prompt)


CompactPrompt shortens the text you send to an AI model while preserving its
meaning. The result costs less to run, returns faster, and is less likely to
exceed the model's input limit. The common case is a single function call, and
no background in machine learning is required to use it.

## Background

An AI model reads an input — the *prompt* — and returns a response. Providers
charge according to the amount of text processed, measured in *tokens* (each
token is roughly three-quarters of a word), and every model has a maximum input
size. A long prompt that combines instructions, documents, tables, and examples
therefore costs more, responds more slowly, and may not fit at all.

CompactPrompt reduces the size of a prompt while retaining the information that
matters, so you keep the substance and discard the overhead.

## Getting started

Install the library:

```bash
pip install compactprompt
```

Shorten a prompt:

```python
from compactprompt import CompactPrompt

result = CompactPrompt.compact(
    "Please could you very kindly go ahead and provide a really concise "
    "summary of the quarterly report."
)

print(result.text)
print(f"{result.ratio:.1f}x smaller "
      f"({result.tokens_before} -> {result.tokens_after} tokens)")
```

Output:

```
a really concise summary of the quarterly report.
1.7x smaller (22 -> 13 tokens)
```

The filler — *"Please could you very kindly go ahead and"* — is removed, and the
meaning is unchanged.

## What it does

CompactPrompt provides several methods for reducing the size of a prompt. They
can be used individually or together. Each is described below in plain terms,
followed by an optional technical note.

### Trimming low-value wording

Removes words that carry little meaning, such as conversational filler, and
keeps the words that do. This is lossy: the removed words are not recoverable,
but the result is ready to use as it is.

```python
from compactprompt import CompactPrompt

# Remove approximately 40% of the tokens
result = CompactPrompt.compact(prompt, ratio=0.4)

# Or target a specific size
result = CompactPrompt.compact(prompt, budget=64)
```

<details>
<summary>Technical detail</summary>

Each word receives an information score that combines how rare it is in general
(static self-information) with how predictable it is in context (dynamic
self-information from a small language model). Low-scoring words are removed.
Whole grammatical phrases are removed together, using spaCy, so the result
remains readable, and names and numbers are protected. This implements the
fusion rule from the *CompactPrompt* paper.
</details>

### Reversible shortening of repeated phrases

When a phrase recurs, it is replaced with a short placeholder, and a key records
what each placeholder stands for. This is lossless: the exact original can be
restored at any time.

```python
import compactprompt as cp

doc = "operating cash flow rose. operating cash flow fell. operating cash flow held."
abbr = cp.abbreviate(doc, n=3)

print(abbr.text)        # '@0 rose. @0 fell. @0 held.'
print(abbr.dictionary)  # {'@0': 'operating cash flow'}
print(abbr.restore())   # the exact original
```

Retain `abbr.dictionary` so the placeholders can be expanded again later.

### Reducing the size of numeric data

Large tables of numbers consume many tokens. This lowers their precision to save
space while guaranteeing that the rounding never exceeds a known bound.

```python
import compactprompt as cp

q = cp.quantize([1.0, 2.5, 3.3, 4.8, 9.2, 10.0], bits=8)
q.reconstruct()   # the rounded values
q.max_error       # the guaranteed maximum error
```

### Selecting representative examples

Models perform better when shown a few examples. If you have many candidate
examples, this selects a small, varied subset that still reflects the full
range, so you send a representative few rather than all of them.

```python
from compactprompt import select_examples

chosen = select_examples(my_examples)
chosen.examples
```

## Choosing how the wording is trimmed

The wording-trimming step can be carried out by any of three interchangeable
engines. All of them shorten text; they differ in how they decide what to remove
and in what they require to run. Select one with the `engine` argument — nothing
else in your code changes.

| Engine | Approach | Requirements |
|--------|----------|--------------|
| **Built-in** (default) | Scores each word and removes the least useful. Runs locally. | None |
| **LLMLingua** | Microsoft's established tool, which uses a small model to decide what to remove. | Downloads a model |
| **Caveman** | Rewrites the text in a concise style, preserving code, links, and headings. | Access to a language model |

```python
CompactPrompt.compact(prompt)                       # built-in, no extra install
CompactPrompt.compact(prompt, engine="llmlingua")   # pip install 'compactprompt[llmlingua]'
CompactPrompt.compact(prompt, engine="caveman")     # pip install 'compactprompt[caveman]'
```

The built-in engine and the other core features implement the
[*CompactPrompt* research paper](https://arxiv.org/abs/2510.18043). LLMLingua and
Caveman are independent open-source tools that this library integrates; see
[Attribution](#attribution).

## Compacting files and skills

CompactPrompt can also compact whole markdown files — documentation,
`CLAUDE.md`, notes, and Claude Code **skills** (`SKILL.md`) — not just strings.
It can first **review** a file or folder to report where the savings are.

This works safely by design: YAML frontmatter is preserved exactly, fenced code
blocks and links are never altered, the result is rejected if it would change a
heading, code block, or URL, and nothing is written without `--apply` (which
first saves a `.bak` backup). Files that look like code, config, or secrets are
skipped automatically.

From the command line:

```bash
# See where the savings are (read-only)
compactprompt review ./skills

# Preview the compaction of one skill (writes nothing)
compactprompt compact ./skills/my-skill/SKILL.md --engine builtin

# Apply it (saves SKILL.md.bak, then rewrites the file)
compactprompt compact ./skills/my-skill/SKILL.md --engine caveman --apply
```

`--engine` is required — you choose `builtin`, `llmlingua`, or `caveman` each
time (caveman, which rewrites prose, is usually best for human-readable files).

From Python:

```python
from compactprompt import review_file, compact_file

report = review_file("SKILL.md")
print(report.tokens, report.issues)

result = compact_file("SKILL.md", engine="caveman", apply=True)
print(result.tokens_before, "->", result.tokens_after)
```

The Streamlit app's **Files & Skills** tab does the same interactively.

## Optional features

The basic installation requires no setup. Additional features depend on extra
components, which you install only as needed:

```bash
pip install compactprompt                  # core: trimming and reversible shortening
pip install 'compactprompt[ml]'            # numeric reduction and example selection
pip install 'compactprompt[llmlingua]'     # the LLMLingua engine
pip install 'compactprompt[caveman]'       # the Caveman engine
pip install 'compactprompt[mcp]'           # the MCP server for AI agents
pip install 'compactprompt[app]'           # the interactive application
pip install 'compactprompt[all]'           # everything
```

When a feature needs a component that is not installed, CompactPrompt reports
exactly what to install.

## Interactive application

A small web application lets you paste a prompt and see it shortened, with the
savings reported as you go:

```bash
pip install 'compactprompt[app]'
streamlit run compactprompt_app.py
```

It opens in the browser. Use the sidebar to choose an engine and set how much to
remove.

## Use it from an AI agent

CompactPrompt ships an **MCP server** so AI coding tools (Claude Code, Codex,
Cursor, Gemini, and any MCP-capable agent) can review and compact prompts, docs,
and skills directly:

```bash
pip install 'compactprompt[mcp]'      # provides the `compactprompt-mcp` command
claude mcp add compactprompt -- compactprompt-mcp   # e.g. for Claude Code
```

The [`agent-skills/`](agent-skills/) directory also has lightweight skill/rules
files and an `install.sh` for the same tools. See its
[README](agent-skills/README.md) for per-tool configuration.

<!-- mcp-name: io.github.gtkcyber/compactprompt -->

## Confirming the meaning is preserved

To check that a shortened prompt still means the same thing, you can measure the
similarity between the original and the result, where 1.0 indicates identical
meaning:

```python
from compactprompt import cosine_fidelity   # pip install 'compactprompt[embeddings]'

score = cosine_fidelity(original_text, result.text)
print(score.mean)
```

## Reference

`CompactPrompt.compact(...)` returns a result object with the following fields:

| Field | Meaning |
|-------|---------|
| `.text` | The shortened prompt. |
| `.original` | The input. |
| `.tokens_before` / `.tokens_after` | Size before and after. |
| `.ratio` | How many times smaller (for example, `2.3`). |
| `.savings` | Fraction of tokens saved (for example, `0.4`). |
| `.dictionary` | The key for restoring shortened phrases, when used. |
| `.restore()` | Reverses the 
aiai-toolsclaudellmllm-toolsmcpmcp-serverprompt-compressionprompt-engineeringpythontokens

What people ask about compact_prompt

What is gtkcyber/compact_prompt?

+

gtkcyber/compact_prompt is mcp servers for the Claude AI ecosystem. A Python Module Which Compacts LLM Prompts It has 0 GitHub stars and was last updated 2d ago.

How do I install compact_prompt?

+

You can install compact_prompt by cloning the repository (https://github.com/gtkcyber/compact_prompt) or following the README instructions on GitHub. ClaudeWave also provides quick install blocks on this page.

Is gtkcyber/compact_prompt safe to use?

+

gtkcyber/compact_prompt has not been audited yet by our security agent. Review the original repository on GitHub before using it in production.

Who maintains gtkcyber/compact_prompt?

+

gtkcyber/compact_prompt is maintained by gtkcyber. The last recorded GitHub activity is from 2d ago, with 0 open issues.

Are there alternatives to compact_prompt?

+

Yes. On ClaudeWave you can browse similar mcp servers at /categories/mcp, sorted by popularity or recent activity.

Deploy compact_prompt to your cloud

Ship this repo to production in minutes. Each platform spins up its own environment with editable env vars.

Maintain this repo? Add a badge to your README

Drop the badge into your GitHub README to show it's tracked on ClaudeWave. Each badge links back to this page and reflects the live Trust Score.

Featured on ClaudeWave: gtkcyber/compact_prompt
[![Featured on ClaudeWave](https://claudewave.com/api/badge/gtkcyber-compact-prompt)](https://claudewave.com/repo/gtkcyber-compact-prompt)
<a href="https://claudewave.com/repo/gtkcyber-compact-prompt"><img src="https://claudewave.com/api/badge/gtkcyber-compact-prompt" alt="Featured on ClaudeWave: gtkcyber/compact_prompt" width="320" height="64" /></a>

More MCP Servers

compact_prompt alternatives