Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep

TL;DR

Semble is a code search tool designed for agents that uses approximately 98% fewer tokens than traditional grep-based methods. It offers rapid, accurate code retrieval on CPU without external services, significantly improving developer workflows.

Semble, a new code search library tailored for AI agents, claims to reduce token usage by approximately 98% compared to traditional grep+read methods, enabling faster and more efficient code retrieval without external dependencies.

Developed to serve agents that require quick access to code snippets, Semble indexes repositories in under a second and answers queries in about 1.5 milliseconds, all on CPU. It achieves comparable retrieval quality to specialized transformer models, with a token reduction of around 98%, which significantly decreases computational costs and latency.

Semble can be integrated as an MCP server or invoked directly via command-line tools. It supports local repositories or remote git URLs, automatically re-indexing files on change. The library requires no API keys, GPU, or external services, making it accessible and easy to deploy.

According to the developer, benchmarks show Semble’s indexing is roughly 200 times faster, and query response is about 10 times quicker than code-specialized transformers, while maintaining 99% of their retrieval accuracy. It is compatible with various agents, including Claude Code, Codex, Cursor, and OpenCode, through straightforward setup instructions.

Why It Matters

This development matters because it offers a highly efficient, cost-effective method for AI agents and developers to search large codebases rapidly and accurately. By drastically reducing token consumption and eliminating reliance on external APIs or hardware accelerators, Semble can enhance productivity and scalability in code-centric AI workflows.

For organizations and individual developers working with large repositories or multiple agents, Semble could reduce operational costs and improve response times, making AI-assisted coding more practical and accessible.

FOXWELL NT301 OBD2 Scanner Live Data Professional Mechanic OBDII Diagnostic Code Reader Tool for Check Engine Light

FOXWELL NT301 OBD2 Scanner Live Data Professional Mechanic OBDII Diagnostic Code Reader Tool for Check Engine Light

【Vehicle CEL Doctor】The NT301 obd2 scanner enables you to read DTCs, access to e-missions readiness status, turn off…

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Background

Traditional code search methods like grep are limited by token size and speed, especially when integrated into AI workflows. Existing transformer-based models provide accurate retrieval but are resource-intensive, often requiring GPUs and external APIs. Semble emerges as a lightweight alternative, emphasizing speed and token efficiency, and is part of a broader trend toward local, scalable AI tooling.

Its announcement follows ongoing efforts to optimize AI agent integrations with codebases, addressing bottlenecks in speed and cost. Prior tools have relied on external services or large models, but Semble’s local CPU-based approach aims to democratize high-performance code search.

“Semble indexes repositories in under a second and answers queries in about 1.5 milliseconds, all on CPU, with 99% retrieval quality.”

— Semble Developer

“It reduces token usage by approximately 98% compared to grep+read, significantly cutting costs and latency.”

— Semble Developer

Inateck 2D Barcode Scanner, Wireless Bluetooth QR Code Scanner with AI APP & SDK, 180-Day Battery Life, Fast & Accurate Scanning, Compatible with iOS/Android/Windows

Inateck 2D Barcode Scanner, Wireless Bluetooth QR Code Scanner with AI APP & SDK, 180-Day Battery Life, Fast & Accurate Scanning, Compatible with iOS/Android/Windows

Powerful Scanning Capability: The Inateck 2D barcode scanner accurately reads almost all 1D and 2D barcodes within a…

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

What Remains Unclear

It is not yet clear how Semble performs across diverse codebases or in comparison to the latest transformer models in real-world scenarios. Long-term stability, scalability, and integration challenges remain to be tested in varied environments.

ResumeMaker Professional Deluxe 20 - Software to Create Professional Resumes Includes Sample Resumes Written by Certified Resume Writers, Career Advice, Job Searches & Interview Questions - CD - PC

ResumeMaker Professional Deluxe 20 – Software to Create Professional Resumes Includes Sample Resumes Written by Certified Resume Writers, Career Advice, Job Searches & Interview Questions – CD – PC

Works on Windows 11, 10, & 8

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

What’s Next

Next steps include broader adoption and testing of Semble across different projects, further benchmarking against other code search tools, and potential feature enhancements such as support for additional agents or more complex queries.

Developers and organizations may also explore integrating Semble into their CI/CD pipelines or AI workflows to evaluate its impact on productivity and costs.

Beyond Vibe Coding: From Coder to AI-Era Developer

Beyond Vibe Coding: From Coder to AI-Era Developer

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

How does Semble compare to traditional grep?

Semble uses approximately 98% fewer tokens than grep+read, providing faster and more efficient code searches tailored for AI agents, with comparable accuracy.

Does Semble require external services or GPUs?

No, Semble runs entirely on CPU and does not require API keys, GPUs, or external dependencies, making it easy to deploy locally.

Can Semble handle remote repositories?

Yes, Semble supports both local paths and remote git URLs, automatically cloning and indexing repositories as needed.

What agents or tools can integrate with Semble?

Semble integrates with agents like Claude Code, Codex, Cursor, and OpenCode via MCP or CLI, enabling seamless code search within existing workflows.

What are the future plans for Semble?

Future developments may include broader adoption, more benchmarking, and additional features to support complex queries and larger codebases.

You May Also Like

Python JIT project was asked to pause development

Python Steering Council has asked for a pause on new JIT features in CPython until a formal PEP is approved, citing process and maintenance concerns.

The Simple Home Network Checklist That Makes Everything More Reliable

Making your home network more reliable starts with this essential checklist that reveals the key steps, and you’ll want to see what comes next.

Clawdmeter turns your Claude Code usage stats into a tiny desktop dashboard

Clawdmeter is an open-source device that visualizes Claude Code usage stats on a small desktop display, blending fun design with developer utility.

The Birthplace of AI

Researchers have verified the location where artificial intelligence was first developed, marking a significant milestone in AI history.