🧠 Different Weight Geometries with Similar LLM Training Methods
Research by AlexWortega has shown that DPO, GRPO, and DAPO methods form a fundamentally different model weight structure compared to SFT, RFT, and DFT, even on identical data. This effect is stable and independent of hyperparameters.
🌍 This is critically important for developing transfer learning methods and assessing model reliability, as different training methods create different internal representations of knowledge.
👤 This explains why models with the same accuracy can behave differently in new tasks — their "internal map" of knowledge is constructed differently.
Source 1: https://huggingface.co/spaces/AlexWortega/same-data-different-losses Source 2: https://x.com/justALEXWORTEGA/status/2068790635570561429
