[ANN] SemanticCaches.jl: Save Time and Money with Request Caching

svilupp · July 8, 2024, 8:07am

Introducing SemanticCaches.jl: Save Time and Money with Request Caching

Are you tired of repeated LLM API calls slowing down your AI application and eating into your budget? Look no further than SemanticCaches.jl, a caching package designed specifically for LLM workloads.

By caching expensive and slow LLM API calls (can be up to 20 seconds!!), this package helps reduce the time and money spent on repeated requests. With its semantic similarity-based caching system, SemanticCaches.jl ensures that you get the most out of your resources.

How it Works

SemanticCaches.jl offers two types of caching: exact matching (HashCache) and semantic similarity lookup (SemanticCache). The package uses a tiny BERT model to provide fast local embeddings on a CPU, making it suitable for applications with smaller volumes of requests. With the semantic similarity for smaller inputs, you can improve your cache hit rate (typos, small reformulation, etc).

Key Features

Reduces costs by caching expensive API calls
Ideal for demos, small user applications, and evals with repeated requests
Suitable for AI models with smaller volumes of requests (<10k per session or machine)
Supports caching HTTP requests with PromptingTools.jl

Get Started

To install SemanticCaches.jl, simply add the package using the Julia package manager:

using Pkg;
Pkg.activate(".")
Pkg.add("SemanticCaches")

Check out the Quick Start Guide to get started with SemanticCaches.jl today!

Try it Today
Try SemanticCaches.jl today and start saving time and money on your GenAI application development!

Topic		Replies	Views
[ANN] PromptingTools.jl - Your Daily Dose of AI Efficiency! Package Announcements announcement , productivity , generative-ai , prompting	7	1778	April 11, 2024
[ANN] Introducing LLMAccess: A Simple Julia Wrapper for LLM REST APIs Package Announcements package , llm	4	513	February 12, 2025
Julia response to Auto-GPT Machine Learning	2	1045	May 6, 2023
A Julia DSL for language models Machine Learning generative-ai	14	1838	February 28, 2024
[ANN]: AskAI - Your Direct AI Support in REPL :) Package Announcements repl , ai	16	995	April 20, 2025

[ANN] SemanticCaches.jl: Save Time and Money with Request Caching

Related topics