Product

Categories
Trending
Tags
Collections
New Repos
Comparisons

Resources

Submit a Repo
About
Newsletter
Privacy Policy
Terms
Contact

Popular Tags

#react
#typescript
#python
#machine-learning
#nextjs

sourcevana

Discover, Download, Deploy — Open Source Made Easy.

kreuzberg-dev/kreuzberg — framework efficiently extracts text, metadata, images, | Sourcevana

HomeAI Assistants & Chatbotskreuzberg

kreuzberg

by kreuzberg-dev

This framework efficiently extracts text, metadata, images, and structured data from over 97 document and image formats.

Verified Snapshot (380.0 MB)

Rust8.4K497Updated 16d agoFeatured

GitHub

Quick Overview

What is this?

Kreuzberg is a high-performance document intelligence framework built on Rust that processes PDFs, Office files, images, and many other formats. It offers broad integration options with bindings for languages like Rust, Python, Java, Node.js, and C#. You can interact with it via its command-line interface, like running `kreuzberg process document.pdf`, or by integrating language-specific bindings into your applications (e.g., `import kreuzberg` in Python or Node.js).

What problem does it solve?

This framework efficiently extracts text, metadata, images, and structured data from over 97 document and image formats.

Who should use it?

Developers needing to programmatically extract structured and unstructured data from diverse document formats should consider Kreuzberg.

Setup difficulty:Medium

Pros

Extracts diverse data types including text, metadata, images, and structured information.

Supports an extensive range of over 97 document and image formats.

Offers bindings and integration options for 10+ programming languages like Python, Java, and Node.js.

Cons

The Rust core might require specific build tools or FFI setup when integrating into non-Rust projects.
Deploying the MCP server or configuring the REST API adds operational complexity compared to a pure library dependency.

Scores

Trust Score

Star reputation (15%)79

Star velocity 7d (15%)0

Commit recency (15%)80

Fork ratio (10%)20

Issue ratio (10%)98

Contributor signal (10%)99

README quality (5%)100

License (5%)100

Homepage/demo (5%)100

Docs URL (5%)0

Topic count (5%)100

Maintenance

Commit frequency95

Issue management80

Documentation85

Popularity

Stars100

Forks50

Growth trend10

Star History

Snapshot Versions

Version	Commit	Size	Downloads	Date
latestLatest	HEAD	380.0 MB	0	1mo ago

Alternatives

hermes-agent

NousResearch

The agent that grows with you

Featured

Python177.0K30.2K16d ago

prompts.chat

f.k.a. Awesome ChatGPT Prompts. Share, discover, and collect prompts from the community. Free and open source — self-host for your organization with complete privacy.

Featured

HTML163.2K21.2K17d ago

dify

langgenius

Production-ready platform for agentic workflow development.

Featured

TypeScript143.5K22.6K16d ago

open-webui

User-friendly AI Interface (Supports Ollama, OpenAI API, ...)

Featured

Python139.7K20.0K17d ago

langchain

langchain-ai

The agent engineering platform

Featured

Python138.3K22.9K16d ago

awesome-llm-apps

Shubhamsaboo

Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.

Featured

Python112.6K16.7K18d ago

Community Reviews

Loading reviews...

README

Kreuzberg

Trust Score

Sourcevana Trust · 74/100

License

NOASSERTION

Languages

Rust57.2%

HTML22.1%

Elixir6.4%

Java2.1%

C#1.9%

Python1.6%

Go1.4%

C1.4%

Rich Text Format1.2%

PHP1.0%

Other3.6%

Topics

bun csharp document-intelligence elixir ffi golang java metadata-extraction node pdf-extraction pdfium php+8 more

Homepage

AddedMay 5, 2026

UpdatedJun 19, 2026

Last commit16d ago

Browse more in AI Assistants & Chatbots View all repos by kreuzberg-dev

Embed Trust Badge

README.md preview:

Sourcevana Trust · 74/100

[![Sourcevana Trust](https://sourcevana.com/api/badge/kreuzberg-dev-kreuzberg)](https://sourcevana.com/repo/kreuzberg-dev-kreuzberg)

Paste this into your README.md

Embed Widget

<iframe src="https://sourcevana.com/embed/kreuzberg-dev-kreuzberg" width="480" height="120" frameborder="0"></iframe>

Embed this repo card on any website or blog