Amir Hossein Kargaran

prof_pic.jpg

I’m a Computer Science Ph.D. student at Munich University, advised by Prof. Hinrich Schütze. I’m also affiliated as a junior member with the Munich Center for Machine Learning.

My PhD research focuses on multilingual NLP, specifically on scaling NLP technologies to include more languages:

  • GlotCC: CommonCrawl corpus for more than 1,000 languages.
  • MEXA: Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment for many langauges.

Before starting my Ph.D., I completed my M.Sc. degree in Computer Engineering at Sharif University of Technology in 2022. I earned two B.Sc. degrees in Electrical Engineering and Computer Engineering from Isfahan University of Technology in 2020. Additionally, I was a research intern with the User Interfaces group at Aalto University during the summers of 2021 and 2022. I contributed to the development of the Aalto Interface Metrics (AIM) project, a service and codebase for computational GUI evaluation.

I’ll be on the job market in Fall 2025, exploring opportunities in research or applied NLP roles. My work focuses on large language models, multilingual NLP, agents, translation and the intersection of NLP and programming languages. If you think there’s a potential fit, I’d love to hear from you.

news

Jul 8, 2025 📌 FineWeb2 is accepted at COLM 2025.
May 23, 2025 🛠️ I’m organizing the MELT workshop (Twitter, OpenReview) at COLM 2025.
May 15, 2025 📌 MEXA and a paper on code language models are accepted at ACL Findings 2025.

see all the news here.