Commit Graph

  • 2a50373311 test: put scripts/ on sys.path so benchmark's sibling _cost import resolves in CI main wassname 2026-06-19 08:47:41 +08:00
  • 28d04f1e1d gitignore: match loraxs_ review scratch; track curated loraxs_review.md wassname 2026-06-19 06:04:25 +08:00
  • 8005423c47 README: note LoRA-XS all-linear spread didn't help (test 55.6 vs down_proj 56.8) wassname 2026-06-18 23:49:36 +08:00
  • 5d910996b3 justfile: bench-variant takes a target_override arg, routed to its own out dir wassname 2026-06-18 21:53:30 +08:00
  • a75bed492b README: add LoRA-XS variant row (test 56.8 / valid 68.0, params 0.025M) wassname 2026-06-18 21:28:10 +08:00
  • 4e03f9c07f lora_xs: fix docstring -- A=diag(Sr)Vhr has row norms Sr, not orthonormal wassname 2026-06-18 20:01:59 +08:00
  • c792ad3e5f Add LoRA-XS variant: train only r×r core R between frozen SVD factors wassname 2026-06-18 19:48:40 +08:00
  • 12e13cca79 README: rot basis is within noise (seed order flips), soften V claim wassname 2026-06-18 03:32:01 +08:00
  • 00f8cd0872 Correct rot ablation: V>U>both was seed variance, not a real ordering antipasto-variants wassname 2026-06-18 03:31:42 +08:00
  • 12fa56f328 Collapse antipasto family to one variant: rot(V) becomes canonical antipasto wassname 2026-06-17 21:05:51 +08:00
  • 21cc9a84ee gitignore: external-review scratch (.pi, raw jsonl, err txt) + papers/md wassname 2026-06-17 20:29:30 +08:00
  • 09dcfe0d41 Revert ablate lora_c warm-start: variance-PC seed didn't help on SFT wassname 2026-06-17 20:18:41 +08:00
  • 458c3861e8 justfile: bench-variant takes a seed arg (default 0, unchanged) wassname 2026-06-17 18:29:23 +08:00
  • ef69c889a7 Merge antipasto-svd-cores: rotation-free S-space adapter family wassname 2026-06-17 18:28:06 +08:00
  • 12109b6fc0 README: order variant table by test accuracy wassname 2026-06-17 18:26:45 +08:00
  • fe562c2b5c antipasto_ablate: warm-start lora_c from S-space output variance antipasto-svd-cores wassname 2026-06-17 18:18:32 +08:00
  • 6cb350a4b6 README: fill IA3-FF row (56.3/62.0, 86k params, 0 added MACs) wassname 2026-06-17 15:49:02 +08:00
  • 4962bffd7d README: fill EVA + IA3 baseline rows wassname 2026-06-17 15:26:50 +08:00
  • 7e024b4734 comment hygiene + HRA row: shorten docstrings, drop dead init branch, track asvd wassname 2026-06-17 11:16:07 +08:00
  • 5f9d90d8b8 benchmark sweep: rot(U/both) ablation, whitening conclusion, cost rows wassname 2026-06-17 06:17:53 +08:00
  • 7986edad2c fix: calibration through cropped model + detach/checkpoint gaps (external review) wassname 2026-06-16 06:37:18 +08:00
  • d4ec550dd8 fix: corda silently ran as plain SVD; wire calibration + persist data-driven residual wassname 2026-06-16 05:56:02 +08:00
  • 9d027752ad variants: replace arrow's dense block with diagonal-plus-low-rank core wassname 2026-06-15 20:13:15 +08:00
  • 2c56196dea justfile/run_id: r override for low-rank antipasto sweeps wassname 2026-06-15 18:35:54 +08:00
  • e8ca6f5944 README: validation framing per wassname's wording; arrow large-block lr=1e-4 wassname 2026-06-15 18:27:33 +08:00
  • 6b7b3a47dd README: frame the GSM8K table as a validation harness, not a leaderboard wassname 2026-06-15 18:20:53 +08:00
  • 6ab1dfff0e README: antipasto variants as table rows; real PEFT reference wassname 2026-06-15 18:18:09 +08:00
  • fa69e0cac3 README: trim AntiPaSTO section for researcher audience wassname 2026-06-15 18:12:31 +08:00
  • d9d31a160f variants: clean docstrings to research pseudocode; arrow block param wassname 2026-06-15 18:09:53 +08:00
  • 3af2a2a66a Update antipasto.py wassname (Michael J Clark) 2026-06-15 15:41:38 +08:00
  • 90b5199ed9 README: AntiPaSTO family GSM8K results (5 variants, r=256) wassname 2026-06-15 07:05:45 +08:00
  • a5999bdeb8 docs: tighten suppress_only contract + arrow top-b selection note wassname 2026-06-15 06:24:23 +08:00
  • 32b1fd885a justfile: route antipasto bench through r=256/alpha=256 in bench-variant wassname 2026-06-15 05:58:34 +08:00
  • d6b242818a justfile: lr=5e-3 for all antipasto_* cores in bench-variant wassname 2026-06-14 19:20:35 +08:00
  • 0d40cc9b38 Add antipasto_arrow: structured fixed-basis core (cross-direction mixing) wassname 2026-06-14 19:18:59 +08:00
  • b80d7778af Add rotation-free S-space adapter cores (antipasto family) wassname 2026-06-14 19:12:27 +08:00
  • e5048fcaff Update antipasto.py wassname (Michael J Clark) 2026-06-10 15:55:14 +08:00
  • 0dcbc753ac Update antipasto.py wassname (Michael J Clark) 2026-06-10 15:54:49 +08:00
  • 072a816cee docs: fix hallucinated arxiv links in variants table wassname 2026-05-26 05:48:49 +08:00
  • ce8c250422 perf: use matmul for lora adapter projections wassname 2026-05-21 08:23:56 +08:00
  • 56937e1b18 remove dead code: _road_matrix, callable(m) clause, silent git fallback wassname 2026-05-19 19:11:32 +08:00
  • 19888fbb82 antipasto: replace EVA-style group_init with Wanda-style dimension selection wassname 2026-05-01 21:24:52 +08:00
  • f91c7b23f2 antipasto: add EVA-style data-driven group_init wassname 2026-05-01 20:55:56 +08:00
  • b698331cfa feat: add HRA benchmark result (61.6%), update README table wassname 2026-04-27 20:07:19 +08:00
  • f6fd410677 benchmark: antipasto rotate_basis CLI + lr=5e-3 + ablation queue wassname 2026-04-27 16:29:25 +08:00
  • 88f107a423 antipasto: delta_s init 4e-4+N(0,4e-4) from antipasto3, rotate_basis='none' option wassname 2026-04-27 16:27:12 +08:00
  • 7df786e80b remove base_weight_fingerprint and test_lora_lite.py wassname 2026-04-27 16:15:40 +08:00
  • e624cd244f feat: near_zero/near_one init for trainable params (breaks bf16 dead-grad symmetry) wassname 2026-04-27 15:55:05 +08:00
  • 0bd091fe5b tidy wassname 2026-04-27 11:44:40 +08:00
  • a342801807 wip wassname 2026-04-27 11:24:19 +08:00
  • 24ba8deb02 simpler test wassname 2026-04-27 09:47:07 +08:00
  • b60a8c3f9b readme wassname 2026-04-27 09:46:52 +08:00
  • 727ef6ea73 tidy tests to subset of metamath wassname 2026-04-27 09:20:07 +08:00
  • 1a93df10b2 fixes wassname 2026-04-27 07:46:10 +08:00
  • bb8887e66c tidy wassname 2026-04-27 07:12:56 +08:00
  • 74c374e741 tidy, review wassname 2026-04-27 07:03:24 +08:00
  • a44fc039af rm defensive docstr wassname 2026-04-27 06:39:18 +08:00
  • a81ed6ffaf misc wassname 2026-04-27 06:23:36 +08:00
  • b179771cc6 tyro and benchmark wassname 2026-04-27 06:23:30 +08:00
  • 67a6daf6aa fix: 5 V4 must-fix bugs (DeLoRA B-init, HRA forward order, EVA A trainable, AntiPaSTO refs, qwen probe) wassname 2026-04-26 20:57:24 +08:00
  • 053901e0ca types, review wassname 2026-04-26 20:35:38 +08:00
  • 55757e829d fix V3 review must-fixes: DoRA bias passthrough + EVA load path copilot 2026-04-26 19:50:48 +08:00
  • 185eb29c70 fix v2 review bugs + add EVA, AntiPaSTO copilot 2026-04-26 19:41:59 +08:00
  • fdb4c77d6c Add reference-impl URLs to variant docstrings + V2 external review wassname 2026-04-26 19:27:47 +08:00
  • d0b4c52740 External review: per-variant audit + design notes wassname 2026-04-26 19:01:29 +08:00
  • 7eeaeed206 Verify all variants on bnb 4bit/8bit; HRA paper-faithful rewrite wassname 2026-04-26 18:08:06 +08:00
  • 0d929f93b3 feat(hra): add Householder Reflection Adaptation, hook-only/bnb-friendly + Qwen proof wassname 2026-04-26 17:58:56 +08:00
  • 43e620176c docs: record DoRA + IA3 Qwen-0.6B proof results (tasks 80, 81) wassname 2026-04-26 17:54:54 +08:00
  • 2abf616be6 feat(dora): add weight-decomposed LoRA variant for fp layers wassname 2026-04-26 17:53:33 +08:00
  • 699fde31bf feat: ia3 variant, real bnb 4bit/8bit smoke, dev guide split, user-only readme wassname 2026-04-26 17:49:17 +08:00
  • f2d9021511 ci: add publishable check workflow wassname 2026-04-26 17:09:47 +08:00
  • 69bf5f4e44 test: prove adapter training paths wassname 2026-04-26 17:00:39 +08:00
  • 4db5cee5a9 init wassname 2026-04-26 14:10:20 +08:00
  • de97724b65 init wassname 2026-04-26 14:10:18 +08:00