M.S. | 5th Year M.S.

M.S.

Monitoring Latent World States in Language Models with Propositional Probes
Jiahai Feng [2025]

5th Year M.S.

Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaption
Danny Halawi [2024]