Faculty Publications - Nika Haghtalab
Masters Reports
- D. Halawi, A. Wei, E. Wallace, T. Wang, N. Haghtalab, and J. Steinhardt, "Covert Malicious Finetuning: Challenges in Safeguarding LLM Adaption," EECS Department, University of California, Berkeley, Tech. Rep. UCB/EECS-2024-216, Dec. 2024.