Iโ€™m a 5th year PhD candidate in Applied Mathematics, and my research is on mechanistic interpretability, primarily oriented towards AI safety. Iโ€™m currently most excited about Stochastic Parameter Decomposition, Understanding Attention Heads via pattern dynamics, and a particular kind of model reverse engineering.

Some of my previous work in mechanistic interpretability was on understanding search in toy models trained on mazes. Iโ€™ve also worked on projects in game theory, computational neuroscience, and computer vision. For more on my research, see here.

Outside of research, I like hiking, rock climbing, and fossil hunting. I am big fan of old sci-fi novels, particularly of the works of Arthur C. Clarke. I also have a number of open source projects, which are tooling for either ML research, personal knowledge management systems, or other random things.

Contact me:
mivanits ๐Ÿ‡ฆโ€‹โ€‹โ€‹โ€‹โ€‹๐Ÿ‡นโ€‹โ€‹โ€‹โ€‹โ€‹ mines ๐Ÿ‡ฉโ€‹โ€‹โ€‹โ€‹โ€‹๐Ÿ‡ดโ€‹โ€‹โ€‹โ€‹โ€‹๐Ÿ‡นโ€‹โ€‹โ€‹โ€‹โ€‹ ๐Ÿ‡ชโ€‹โ€‹โ€‹โ€‹โ€‹๐Ÿ‡ฉโ€‹โ€‹โ€‹โ€‹โ€‹๐Ÿ‡บโ€‹โ€‹โ€‹โ€‹
or:
{anything_you_want} ๐Ÿ‡ฆโ€‹โ€‹โ€‹โ€‹โ€‹๐Ÿ‡นโ€‹โ€‹โ€‹โ€‹โ€‹ miv ๐Ÿ‡ฉโ€‹โ€‹โ€‹โ€‹โ€‹๐Ÿ‡ดโ€‹โ€‹โ€‹โ€‹โ€‹๐Ÿ‡น name

LinkedIn | GitHub | Google Scholar | ORCID