A Python Module Which Compacts LLM Prompts
claude mcp add compact-prompt -- python -m compactprompt{
"mcpServers": {
"compact-prompt": {
"command": "python",
"args": ["-m", "compactprompt"]
}
}
}MCP Servers overview
# CompactPrompt
[](https://github.com/gtkcyber/compact_prompt/actions/workflows/tests.yml)
[](https://github.com/gtkcyber/compact_prompt/actions/workflows/pylint.yml)
[](https://compact-prompt.readthedocs.io/en/latest/)

[](https://glama.ai/mcp/servers/gtkcyber/compact_prompt)
CompactPrompt shortens the text you send to an AI model while preserving its
meaning. The result costs less to run, returns faster, and is less likely to
exceed the model's input limit. The common case is a single function call, and
no background in machine learning is required to use it.
## Background
An AI model reads an input — the *prompt* — and returns a response. Providers
charge according to the amount of text processed, measured in *tokens* (each
token is roughly three-quarters of a word), and every model has a maximum input
size. A long prompt that combines instructions, documents, tables, and examples
therefore costs more, responds more slowly, and may not fit at all.
CompactPrompt reduces the size of a prompt while retaining the information that
matters, so you keep the substance and discard the overhead.
## Getting started
Install the library:
```bash
pip install compactprompt
```
Shorten a prompt:
```python
from compactprompt import CompactPrompt
result = CompactPrompt.compact(
"Please could you very kindly go ahead and provide a really concise "
"summary of the quarterly report."
)
print(result.text)
print(f"{result.ratio:.1f}x smaller "
f"({result.tokens_before} -> {result.tokens_after} tokens)")
```
Output:
```
a really concise summary of the quarterly report.
1.7x smaller (22 -> 13 tokens)
```
The filler — *"Please could you very kindly go ahead and"* — is removed, and the
meaning is unchanged.
## What it does
CompactPrompt provides several methods for reducing the size of a prompt. They
can be used individually or together. Each is described below in plain terms,
followed by an optional technical note.
### Trimming low-value wording
Removes words that carry little meaning, such as conversational filler, and
keeps the words that do. This is lossy: the removed words are not recoverable,
but the result is ready to use as it is.
```python
from compactprompt import CompactPrompt
# Remove approximately 40% of the tokens
result = CompactPrompt.compact(prompt, ratio=0.4)
# Or target a specific size
result = CompactPrompt.compact(prompt, budget=64)
```
<details>
<summary>Technical detail</summary>
Each word receives an information score that combines how rare it is in general
(static self-information) with how predictable it is in context (dynamic
self-information from a small language model). Low-scoring words are removed.
Whole grammatical phrases are removed together, using spaCy, so the result
remains readable, and names and numbers are protected. This implements the
fusion rule from the *CompactPrompt* paper.
</details>
### Reversible shortening of repeated phrases
When a phrase recurs, it is replaced with a short placeholder, and a key records
what each placeholder stands for. This is lossless: the exact original can be
restored at any time.
```python
import compactprompt as cp
doc = "operating cash flow rose. operating cash flow fell. operating cash flow held."
abbr = cp.abbreviate(doc, n=3)
print(abbr.text) # '@0 rose. @0 fell. @0 held.'
print(abbr.dictionary) # {'@0': 'operating cash flow'}
print(abbr.restore()) # the exact original
```
Retain `abbr.dictionary` so the placeholders can be expanded again later.
### Reducing the size of numeric data
Large tables of numbers consume many tokens. This lowers their precision to save
space while guaranteeing that the rounding never exceeds a known bound.
```python
import compactprompt as cp
q = cp.quantize([1.0, 2.5, 3.3, 4.8, 9.2, 10.0], bits=8)
q.reconstruct() # the rounded values
q.max_error # the guaranteed maximum error
```
### Selecting representative examples
Models perform better when shown a few examples. If you have many candidate
examples, this selects a small, varied subset that still reflects the full
range, so you send a representative few rather than all of them.
```python
from compactprompt import select_examples
chosen = select_examples(my_examples)
chosen.examples
```
## Choosing how the wording is trimmed
The wording-trimming step can be carried out by any of three interchangeable
engines. All of them shorten text; they differ in how they decide what to remove
and in what they require to run. Select one with the `engine` argument — nothing
else in your code changes.
| Engine | Approach | Requirements |
|--------|----------|--------------|
| **Built-in** (default) | Scores each word and removes the least useful. Runs locally. | None |
| **LLMLingua** | Microsoft's established tool, which uses a small model to decide what to remove. | Downloads a model |
| **Caveman** | Rewrites the text in a concise style, preserving code, links, and headings. | Access to a language model |
```python
CompactPrompt.compact(prompt) # built-in, no extra install
CompactPrompt.compact(prompt, engine="llmlingua") # pip install 'compactprompt[llmlingua]'
CompactPrompt.compact(prompt, engine="caveman") # pip install 'compactprompt[caveman]'
```
The built-in engine and the other core features implement the
[*CompactPrompt* research paper](https://arxiv.org/abs/2510.18043). LLMLingua and
Caveman are independent open-source tools that this library integrates; see
[Attribution](#attribution).
## Compacting files and skills
CompactPrompt can also compact whole markdown files — documentation,
`CLAUDE.md`, notes, and Claude Code **skills** (`SKILL.md`) — not just strings.
It can first **review** a file or folder to report where the savings are.
This works safely by design: YAML frontmatter is preserved exactly, fenced code
blocks and links are never altered, the result is rejected if it would change a
heading, code block, or URL, and nothing is written without `--apply` (which
first saves a `.bak` backup). Files that look like code, config, or secrets are
skipped automatically.
From the command line:
```bash
# See where the savings are (read-only)
compactprompt review ./skills
# Preview the compaction of one skill (writes nothing)
compactprompt compact ./skills/my-skill/SKILL.md --engine builtin
# Apply it (saves SKILL.md.bak, then rewrites the file)
compactprompt compact ./skills/my-skill/SKILL.md --engine caveman --apply
```
`--engine` is required — you choose `builtin`, `llmlingua`, or `caveman` each
time (caveman, which rewrites prose, is usually best for human-readable files).
From Python:
```python
from compactprompt import review_file, compact_file
report = review_file("SKILL.md")
print(report.tokens, report.issues)
result = compact_file("SKILL.md", engine="caveman", apply=True)
print(result.tokens_before, "->", result.tokens_after)
```
The Streamlit app's **Files & Skills** tab does the same interactively.
## Optional features
The basic installation requires no setup. Additional features depend on extra
components, which you install only as needed:
```bash
pip install compactprompt # core: trimming and reversible shortening
pip install 'compactprompt[ml]' # numeric reduction and example selection
pip install 'compactprompt[llmlingua]' # the LLMLingua engine
pip install 'compactprompt[caveman]' # the Caveman engine
pip install 'compactprompt[mcp]' # the MCP server for AI agents
pip install 'compactprompt[app]' # the interactive application
pip install 'compactprompt[all]' # everything
```
When a feature needs a component that is not installed, CompactPrompt reports
exactly what to install.
## Interactive application
A small web application lets you paste a prompt and see it shortened, with the
savings reported as you go:
```bash
pip install 'compactprompt[app]'
streamlit run compactprompt_app.py
```
It opens in the browser. Use the sidebar to choose an engine and set how much to
remove.
## Use it from an AI agent
CompactPrompt ships an **MCP server** so AI coding tools (Claude Code, Codex,
Cursor, Gemini, and any MCP-capable agent) can review and compact prompts, docs,
and skills directly:
```bash
pip install 'compactprompt[mcp]' # provides the `compactprompt-mcp` command
claude mcp add compactprompt -- compactprompt-mcp # e.g. for Claude Code
```
The [`agent-skills/`](agent-skills/) directory also has lightweight skill/rules
files and an `install.sh` for the same tools. See its
[README](agent-skills/README.md) for per-tool configuration.
<!-- mcp-name: io.github.gtkcyber/compactprompt -->
## Confirming the meaning is preserved
To check that a shortened prompt still means the same thing, you can measure the
similarity between the original and the result, where 1.0 indicates identical
meaning:
```python
from compactprompt import cosine_fidelity # pip install 'compactprompt[embeddings]'
score = cosine_fidelity(original_text, result.text)
print(score.mean)
```
## Reference
`CompactPrompt.compact(...)` returns a result object with the following fields:
| Field | Meaning |
|-------|---------|
| `.text` | The shortened prompt. |
| `.original` | The input. |
| `.tokens_before` / `.tokens_after` | Size before and after. |
| `.ratio` | How many times smaller (for example, `2.3`). |
| `.savings` | Fraction of tokens saved (for example, `0.4`). |
| `.dictionary` | The key for restoring shortened phrases, when used. |
| `.restore()` | Reverses the What people ask about compact_prompt
What is gtkcyber/compact_prompt?
+
gtkcyber/compact_prompt is mcp servers for the Claude AI ecosystem. A Python Module Which Compacts LLM Prompts It has 0 GitHub stars and was last updated 2d ago.
How do I install compact_prompt?
+
You can install compact_prompt by cloning the repository (https://github.com/gtkcyber/compact_prompt) or following the README instructions on GitHub. ClaudeWave also provides quick install blocks on this page.
Is gtkcyber/compact_prompt safe to use?
+
gtkcyber/compact_prompt has not been audited yet by our security agent. Review the original repository on GitHub before using it in production.
Who maintains gtkcyber/compact_prompt?
+
gtkcyber/compact_prompt is maintained by gtkcyber. The last recorded GitHub activity is from 2d ago, with 0 open issues.
Are there alternatives to compact_prompt?
+
Yes. On ClaudeWave you can browse similar mcp servers at /categories/mcp, sorted by popularity or recent activity.
Deploy compact_prompt to your cloud
Ship this repo to production in minutes. Each platform spins up its own environment with editable env vars.
Maintain this repo? Add a badge to your README
Drop the badge into your GitHub README to show it's tracked on ClaudeWave. Each badge links back to this page and reflects the live Trust Score.
[](https://claudewave.com/repo/gtkcyber-compact-prompt)<a href="https://claudewave.com/repo/gtkcyber-compact-prompt"><img src="https://claudewave.com/api/badge/gtkcyber-compact-prompt" alt="Featured on ClaudeWave: gtkcyber/compact_prompt" width="320" height="64" /></a>More MCP Servers
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
An open-source AI agent that brings the power of Gemini directly into your terminal.
The fastest path to AI-powered full stack observability, even for lean teams.
🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl!
⭐AI-driven public opinion & trend monitor with multi-platform aggregation, RSS, and smart alerts.🎯 告别信息过载,你的 AI 舆情监控助手与热点筛选工具!聚合多平台热点 + RSS 订阅,支持关键词精准筛选。AI 智能筛选新闻 + AI 翻译 + AI 分析简报直推手机,也支持接入 MCP 架构,赋能 AI 自然语言对话分析、情感洞察与趋势预测等。支持 Docker ,数据本地/云端自持。集成微信/飞书/钉钉/Telegram/邮件/ntfy/bark/slack 等渠道智能推送。