Skip to content

Pinned Loading

  1. kreuzberg kreuzberg Public

    A polyglot document intelligence framework with a Rust core. Extract text, metadata, and structured information from PDFs, Office documents, images, and 88+ formats. Available for Rust, Python, Rub…

    Rust 7.1k 343

  2. html-to-markdown html-to-markdown Public

    High performance and CommonMark compliant HTML to Markdown converter. Maintained by the Kreuzberg team. Kreuzberg is a fast, polyglot document intelligence engine with a Rust core. It extracts stru…

    HTML 598 50

  3. tree-sitter-language-pack tree-sitter-language-pack Public

    Comprehensive tree-sitter grammar compilation with polyglot bindings — Rust, Python, Node.js, Go, Java, Ruby, Elixir, PHP, C#, WASM, and CLI. 170+ languages.

    Rust 297 45

  4. langchain-kreuzberg langchain-kreuzberg Public

    Langchain document loader for Kreuzberg

    Python 4

  5. kreuzberg-surrealdb kreuzberg-surrealdb Public

    Extract, chunk, and embed documents from 88+ formats directly into SurrealDB.

    Python 6

Repositories

Showing 10 of 17 repositories
  • haystack Public Forked from deepset-ai/haystack

    Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and agent workflows with explicit control over retrieval, routing, memory, and generation. Built for scalable agents, RAG, multimodal applications, semantic search, and conversational systems.

    kreuzberg-dev/haystack’s past year of commit activity
    MDX 0 Apache-2.0 2,714 0 0 Updated Mar 26, 2026
  • tree-sitter-language-pack Public

    Comprehensive tree-sitter grammar compilation with polyglot bindings — Rust, Python, Node.js, Go, Java, Ruby, Elixir, PHP, C#, WASM, and CLI. 170+ languages.

    kreuzberg-dev/tree-sitter-language-pack’s past year of commit activity
    Rust 297 MIT 45 1 0 Updated Mar 26, 2026
  • kreuzberg Public

    A polyglot document intelligence framework with a Rust core. Extract text, metadata, and structured information from PDFs, Office documents, images, and 88+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.

    kreuzberg-dev/kreuzberg’s past year of commit activity
    Rust 7,116 MIT 343 24 (1 issue needs help) 3 Updated Mar 26, 2026
  • homebrew-tap Public
    kreuzberg-dev/homebrew-tap’s past year of commit activity
    Ruby 0 0 1 0 Updated Mar 26, 2026
  • html-to-markdown Public

    High performance and CommonMark compliant HTML to Markdown converter. Maintained by the Kreuzberg team. Kreuzberg is a fast, polyglot document intelligence engine with a Rust core. It extracts structured data from 56+ document formats using streaming parsers and built-in OCR.

    kreuzberg-dev/html-to-markdown’s past year of commit activity
    HTML 598 MIT 50 0 4 Updated Mar 26, 2026
  • haystack-integrations Public Forked from deepset-ai/haystack-integrations

    🚀 A list of Haystack Integrations, maintained by the community or deepset.

    kreuzberg-dev/haystack-integrations’s past year of commit activity
    0 129 0 0 Updated Mar 25, 2026
  • haystack-core-integrations Public Forked from deepset-ai/haystack-core-integrations

    Additional packages (components, document stores and the likes) to extend the capabilities of Haystack

    kreuzberg-dev/haystack-core-integrations’s past year of commit activity
    Python 0 Apache-2.0 222 0 0 Updated Mar 25, 2026
  • ai-rulez Public
    kreuzberg-dev/ai-rulez’s past year of commit activity
    2 MIT 0 0 0 Updated Mar 25, 2026
  • actions Public

    Shared GitHub actions

    kreuzberg-dev/actions’s past year of commit activity
    Shell 0 0 0 0 Updated Mar 22, 2026
  • kreuzberg-dev/kreuzberg-crewai’s past year of commit activity
    Python 0 MIT 0 0 0 Updated Mar 21, 2026

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Most used topics

Loading…