mirror of
https://github.com/wassname/adapters_as_hypotheses.git
synced 2026-06-27 15:14:03 +08:00
3800481a30d98951dae0b9b63c6c1405fc784645
30 PEFT methods reframed as hypotheses about transformer geometry. Each entry: pseudocode, hypothesis, evidence, grade. All papers saved to docs/ (full text).
Adapters as Representational Hypotheses
What does each PEFT method believe about transformer internals?
Each adapter architecture encodes a structural claim about how to intervene in pretrained weights. When one outperforms another under controlled conditions (same model, same data, same parameter budget), the winner's assumptions are supported as a better description of the weight manifold.
This catalog reframes ~30 PEFT methods as hypotheses about transformer geometry, extracts pseudocode for each intervention, and grades the evidence.
Evidence hierarchy
| Grade | Meaning |
|---|---|
| * | Parameter-efficient (matches LoRA with fewer params) |
| ** | Beats LoRA on raw performance |
| ! | Beats full fine-tuning |
| !! | Data-efficient (few-shot, fast convergence) |
| !!! | Generalizes out-of-distribution |
Contents
- adapters_as_hypotheses.md -- the main catalog
- docs/ -- saved papers (full text, markdown)
Key findings
- SVD basis is the natural coordinate system. Methods that use the model's own SVD decomposition (PiSSA, SVFT, SSVD, AntiPaSTO) consistently outperform random-basis methods at the same parameter count.
- Orthogonal >> arbitrary. Orthogonal constraints (OFT, BOFT, HRA, AntiPaSTO) preserve semantic structure and improve OOD transfer, at the cost of limited magnitude changes.
- Direction and strength decouple. Methods that separate what to change from how much (DeLoRA, ROAD, AntiPaSTO) show better robustness and enable bidirectional steering.
- Low-rank is necessary but not sufficient. LoRA's rank bottleneck limits hard tasks; full-rank methods (RandLoRA, SHiRA) close the gap with full FT.
- Scaling alone goes far. IA3 and LN Tuning show that a surprising amount of adaptation is just reweighting existing features -- "gain control" over channels.
Related
- A Pragmatic Vision for Interpretability -- Nanda et al. 2025
- AntiPaSTO: Antiparallel Steering -- Clark 2025 (Appendix A.3 is the origin of this framing)
- HuggingFace PEFT -- reference implementations
License
Content is CC-BY-4.0. Papers in docs/ are fetched from arXiv for reference and remain under their original licenses.
Description
Each lora type adapter can tell us something about how to look at transformer internals, and they some with causal evidence
Languages
Markdown
100%