Show HN: Semble – Code search for agents that uses 98% fewer tokens than grep

TL;DR

Semble is a code search tool designed for agents that uses approximately 98% fewer tokens than traditional grep-based methods. It offers rapid, accurate code retrieval on CPU without external services, significantly improving developer workflows.

Semble, a new code search library tailored for AI agents, claims to reduce token usage by approximately 98% compared to traditional grep+read methods, enabling faster and more efficient code retrieval without external dependencies.

Developed to serve agents that require quick access to code snippets, Semble indexes repositories in under a second and answers queries in about 1.5 milliseconds, all on CPU. It achieves comparable retrieval quality to specialized transformer models, with a token reduction of around 98%, which significantly decreases computational costs and latency.

Semble can be integrated as an MCP server or invoked directly via command-line tools. It supports local repositories or remote git URLs, automatically re-indexing files on change. The library requires no API keys, GPU, or external services, making it accessible and easy to deploy.

According to the developer, benchmarks show Semble’s indexing is roughly 200 times faster, and query response is about 10 times quicker than code-specialized transformers, while maintaining 99% of their retrieval accuracy. It is compatible with various agents, including Claude Code, Codex, Cursor, and OpenCode, through straightforward setup instructions.

Why It Matters

This development matters because it offers a highly efficient, cost-effective method for AI agents and developers to search large codebases rapidly and accurately. By drastically reducing token consumption and eliminating reliance on external APIs or hardware accelerators, Semble can enhance productivity and scalability in code-centric AI workflows.

For organizations and individual developers working with large repositories or multiple agents, Semble could reduce operational costs and improve response times, making AI-assisted coding more practical and accessible.

Amazon

code search tools for developers

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Background

Traditional code search methods like grep are limited by token size and speed, especially when integrated into AI workflows. Existing transformer-based models provide accurate retrieval but are resource-intensive, often requiring GPUs and external APIs. Semble emerges as a lightweight alternative, emphasizing speed and token efficiency, and is part of a broader trend toward local, scalable AI tooling.

Its announcement follows ongoing efforts to optimize AI agent integrations with codebases, addressing bottlenecks in speed and cost. Prior tools have relied on external services or large models, but Semble’s local CPU-based approach aims to democratize high-performance code search.

“Semble indexes repositories in under a second and answers queries in about 1.5 milliseconds, all on CPU, with 99% retrieval quality.”

— Semble Developer

“It reduces token usage by approximately 98% compared to grep+read, significantly cutting costs and latency.”

— Semble Developer

Amazon

AI code search library

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

What Remains Unclear

It is not yet clear how Semble performs across diverse codebases or in comparison to the latest transformer models in real-world scenarios. Long-term stability, scalability, and integration challenges remain to be tested in varied environments.

Amazon

local code search software

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

What’s Next

Next steps include broader adoption and testing of Semble across different projects, further benchmarking against other code search tools, and potential feature enhancements such as support for additional agents or more complex queries.

Developers and organizations may also explore integrating Semble into their CI/CD pipelines or AI workflows to evaluate its impact on productivity and costs.

Amazon

developer productivity tools

As an affiliate, we earn on qualifying purchases.

As an affiliate, we earn on qualifying purchases.

Key Questions

How does Semble compare to traditional grep?

Semble uses approximately 98% fewer tokens than grep+read, providing faster and more efficient code searches tailored for AI agents, with comparable accuracy.

Does Semble require external services or GPUs?

No, Semble runs entirely on CPU and does not require API keys, GPUs, or external dependencies, making it easy to deploy locally.

Can Semble handle remote repositories?

Yes, Semble supports both local paths and remote git URLs, automatically cloning and indexing repositories as needed.

What agents or tools can integrate with Semble?

Semble integrates with agents like Claude Code, Codex, Cursor, and OpenCode via MCP or CLI, enabling seamless code search within existing workflows.

What are the future plans for Semble?

Future developments may include broader adoption, more benchmarking, and additional features to support complex queries and larger codebases.

You May Also Like

60% of PC gamers have no plans to build a new PC in the next two years — AI pricing crunch on RAM and other components paralyze enthusiast market

60% of PC gamers surveyed plan to wait over two years before building a new PC, citing high component prices driven by AI data center demand.

Microsoft is retiring Teams’ Together Mode

Microsoft is gradually removing Teams’ Together Mode to streamline user experience and improve video quality, ending a pandemic-era feature.

Agent VCR – Time-travel debugging for LLM agents (rewind, edit state, resume)

Agent VCR introduces local, rewindable, and editable debugging for LLM agents, enabling precise troubleshooting and session management without cloud reliance.

How to Protect Your Photos From Accidental Deletion

When it comes to protecting your photos from accidental deletion, discover essential tips to safeguard your memories effectively.