Update README.md

2026-06-27 17:01:18 +08:00 · 2026-05-15 13:43:57 +08:00
parent 3bf7b51206
commit 3d90059274
1 changed files with 24 additions and 21 deletions
@@ -1,34 +1,37 @@
-# minicache — tiny disk cache for ML / research code.
+## minicache — tiny disk cache for ML / research code.

-Wraps function calls and stores returns on disk (gzip + cloudpickle). Solves
+This wraps function calls and stores returns on disk (gzip + cloudpickle). Solves
 the four pain points that stdlib `functools.lru_cache + pickle` and existing
 function-cache libraries (anycache, cachier) hit on ML code:

- **Loaded models can't be hashed** → arg blacklist (`exclude=["model", "tok"]`).
-  Excluded args pass through to the function but never enter the cache key.
- **Tensors / pandas / closures break stdlib pickle** → cloudpickle backend.
- **Pickle files grow large** → gzip on disk (~3× smaller, free).
- **"Function source changed → invalidate" causes false invalidations on
-  reformat** → caller bumps an explicit `state` string when behavior actually
-  changes. No AST hashing magic.
+- *Loaded models can't be hashed*. So we use a arg blacklist (`exclude=["model", "tok"]`).
+  Here, excluded args pass through to the function but never enter the cache key.
+- *Tensors / pandas / closures can't be picked** → we use cloudpickle which extends to many more objects.
+- *Pickle files grow large* → gzip on disk save 20-50%

 ## Quick use

-```py
-  from minicache import cached, cache_call
+Install

-  # 1. Decorator: hashes (state, included args). Excludes drop out of key.
-  @cached("eval", cachedir="out/cache",
+```sh
+uv add git+https://github.com/wassname/minicache.git
+```
+
+```py
+from minicache import cached, cache_call
+
+# 1. Decorator: hashes (state, included args). Excludes drop out of key.
+@cached("eval", cachedir="out/cache",
        state_fn=lambda *, model_id, **_: f"{model_id}|nf4|r00+r02",
        exclude=["model", "tok"])
-  def run_eval(model, tok, *, model_id, name, batch_size):
+def run_eval(model, tok, *, model_id, name, batch_size):
    return tinymfv_evaluate(model, tok, name=name, batch_size=batch_size)

-  report = run_eval(model, tok, model_id="qwen-27b", name="classic", batch_size=16)
+report = run_eval(model, tok, model_id="qwen-27b", name="classic", batch_size=16)

-  # 2. Explicit key: no introspection, you compose the key
-  key = "qwen-27b|nf4|r00+r02|eval|classic|bs=16"
-  report = cache_call("eval", key, lambda: tinymfv_evaluate(model, tok, ...),
+# 2. Explicit key: no introspection, you compose the key
+key = "qwen-27b|nf4|r00+r02|eval|classic|bs=16"
+report = cache_call("eval", key, lambda: tinymfv_evaluate(model, tok, ...),
                    cachedir="out/cache")
 ```