Arsen Sheverdin

PhD candidate in Machine Learning

arsen.sheverdin [AT] tum.de

About

I am a Doctoral Researcher at the Helmholtz Munich and Technical University of Munich (TUM) at Dr. Vincent Fortuin's lab .

My research interests are in LLM trustworthiness, uncertainty estimation, and LLM efficiency. Currently, I study how LLMs can express uncertainty reliably in natural language, and methods for ensuring calibrated and verifiable uncertainty communication. I am also interested in probabilistic approaches to test-time inference for decision-making and efficient deployment. More broadly, I work on generative modeling and efficient LLMs.

Bio

Previously I graduated as an ELLIS Honours MSc in Artificial Intelligence at the University of Amsterdam, where I wrote my thesis on probabilistic deep learning methods for continual learning with Dr. Eric Nalisnick at the AMLab . I obtained my undergraduate degree in Physics with Mathematics minor at the Nazarbayev University. During my undergraduate, I was fortunate enough to receive the Yessenov Foundation scholarship to conduct a research visit at the Cornell University, where I worked on developing deep learning methods for inverse design for nanophotonic structres, under the supervision of Dr. Francesco Monticone . As an undergraduate, for two years I also was a Research Assistant at my university's Physics department, where I studied the nanophotonic structures and their optimization under the supervision of Dr. Constantinos Valagiannopoulos

Publications

Most recent publications on Google Scholar.

Gaussian stochastic weight averaging for Bayesian low-rank adaptation of large language models

Emre Onal, Klemens Floge, Emma Caldwell, Arsen Sheverdin, Vincent Fortuin

ApproximateThe 7th Symposium on Advances in Approximate Bayesian Inference (AABI), 2024

From Hyperbolic Geometry Back to Word Embeddings

Zhenisbek Assylbekov, Sultan Nurmukhamedov, Arsen Sheverdin, Thomas Mach

Proceedings of the 7th Workshop on Representation Learning for NLP, 39-45 (2022)

Reproducibility report for "Interpretable Complex-Valued Neural Networks for Privacy Protection"

Arsen Sheverdin, Noud Corten, Alko Knijff, Georg Lange

ML Reproducibility Challenge (2020)

Photonic inverse design with neural networks: the case of invisibility in the visible

Arsen Sheverdin, Francesco Monticone, Constantinos Valagiannopoulos

Physical Review Applied, 14(2): 024054 (2020)

Core-shell nanospheres under visible light: Optimal absorption, scattering, and cloaking

Arsen Sheverdin, Constantinos Valagiannopoulos

Physical Review B, 99(7): 075305 (2019)