I am a Doctoral Researcher at the Helmholtz Munich and Technical University of Munich (TUM) at Dr. Vincent Fortuin's lab .
My research interests are in LLM trustworthiness, uncertainty estimation, and LLM efficiency. Currently, I study how LLMs can express uncertainty reliably in natural language, and methods for ensuring calibrated and verifiable uncertainty communication. I am also interested in probabilistic approaches to test-time inference for decision-making and efficient deployment. More broadly, I work on generative modeling and efficient LLMs.
Previously I graduated as an ELLIS Honours MSc in Artificial Intelligence at the University of Amsterdam, where I wrote my thesis on probabilistic deep learning methods for continual learning with Dr. Eric Nalisnick at the AMLab . I obtained my undergraduate degree in Physics with Mathematics minor at the Nazarbayev University. During my undergraduate, I was fortunate enough to receive the Yessenov Foundation scholarship to conduct a research visit at the Cornell University, where I worked on developing deep learning methods for inverse design for nanophotonic structres, under the supervision of Dr. Francesco Monticone . As an undergraduate, for two years I also was a Research Assistant at my university's Physics department, where I studied the nanophotonic structures and their optimization under the supervision of Dr. Constantinos Valagiannopoulos
Most recent publications on Google Scholar.
Gaussian stochastic weight averaging for Bayesian low-rank adaptation of large language models
Emre Onal, Klemens Floge, Emma Caldwell, Arsen Sheverdin, Vincent Fortuin
ApproximateThe 7th Symposium on Advances in Approximate Bayesian Inference (AABI), 2024
From Hyperbolic Geometry Back to Word Embeddings
Zhenisbek Assylbekov, Sultan Nurmukhamedov, Arsen Sheverdin, Thomas Mach
Proceedings of the 7th Workshop on Representation Learning for NLP, 39-45 (2022)
Reproducibility report for "Interpretable Complex-Valued Neural Networks for Privacy Protection"
Arsen Sheverdin, Noud Corten, Alko Knijff, Georg Lange
ML Reproducibility Challenge (2020)
Photonic inverse design with neural networks: the case of invisibility in the visible
Arsen Sheverdin, Francesco Monticone, Constantinos Valagiannopoulos
Physical Review Applied, 14(2): 024054 (2020)
Core-shell nanospheres under visible light: Optimal absorption, scattering, and cloaking
Arsen Sheverdin, Constantinos Valagiannopoulos
Physical Review B, 99(7): 075305 (2019)