Iโm a 5th year PhD candidate in Applied Mathematics, and my research is on mechanistic interpretability, primarily oriented towards AI safety. Iโm currently most excited about Stochastic Parameter Decomposition, Understanding Attention Heads via pattern dynamics, and a particular kind of model reverse engineering.
Some of my previous work in mechanistic interpretability was on understanding search in toy models trained on mazes. Iโve also worked on projects in game theory, computational neuroscience, and computer vision. For more on my research, see here.
Outside of research, I like hiking, rock climbing, and fossil hunting. I am big fan of old sci-fi novels, particularly of the works of Arthur C. Clarke. I also have a number of open source projects, which are tooling for either ML research, personal knowledge management systems, or other random things.
Contact me:
mivanits ๐ฆโโโโโ๐นโโโโโ mines ๐ฉโโโโโ๐ดโโโโโ๐นโโโโโ ๐ชโโโโโ๐ฉโโโโโ๐บโโโโ
or:
{anything_you_want} ๐ฆโโโโโ๐นโโโโโ miv ๐ฉโโโโโ๐ดโโโโโ๐น name
LinkedIn | GitHub | Google Scholar | ORCID