Files
llm-moral-foundations2/README.md
T
2025-08-21 21:17:43 +08:00

363 B

Unbiased Assessment of LLM Moral Foundations: Controlling for Positional Effects and Response Steering

Difference from previous work

  • control for positional bias
  • use mechinterp representation steering

Links:

TODO