Dongyang Fan

I'm a PhD student at Machine Learning and Optimization Lab at EPFL, supervised by Prof. Martin Jaggi. My name is pronounced as Don-Young.

My research interests are:

  • LLMs: data efficiency, factuality enhancement and ethics.
  • Data Valuation and Data Markets: quantification of the impact of data, fair compensation of content generators.
  • Modular and Collaborative Machine Learning: Mixture of Experts, co-distillation, collaborator selection.
I am also happy to branch out my research. If you want to reach out, do not hesitate to drop me an email!

Email  /  Google Scholar  /  Twitter  /  Github  /  LinkedIn

profile photo

Research

URLs Help, Topics Guide: Understanding Metadata Utility in LLM Training
Dongyang Fan, Vinko Sabolčec, Martin Jaggi,
preprint, 2025
arXiv

Can Performant LLMs Be Ethical? Quantifying the Impact of Web Crawling Opt-Outs
Dongyang Fan, Vinko Sabolčec, Matin Ansaripour, Ayush Kumar Tarun, Martin Jaggi, Antoine Bosselut, Imanol Schlag
arXiv, 2025
arXiv

From Fairness to Truthfulness: Rethinking Data Valuation Design
Dongyang Fan, Tyler J. Rotello, Sai Praneeth Karimireddy
ICLR Workshop Data Problems, 2025
arXiv

On-Device Collaborative Language Modeling via a Mixture of Generalists and Specialists
Dongyang Fan*, Bettina Messmer*, Nikita Doikov, Martin Jaggi
International Conference on Machine Learning (ICML), 2025
codes / arXiv

Towards an empirical understanding of MoE design choices
Dongyang Fan*, Bettina Messmer*, Martin Jaggi
ICLR ME-FoMo Workshop, 2024
arXiv

Personalized Collaborative Fine-Tuning for On-Device Large Language Models
Nicolas Wagner, Dongyang Fan, Martin Jaggi
Conference on Language Modeling (COLM), 2024
codes / arXiv

Ghost Noise for Regularizing Deep Neural Networks
Atli Kosson, Dongyang Fan, Martin Jaggi
Association for the Advancement of Artificial Intelligence (AAAI), 2024
arXiv

Collaborative Learning via Prediction Consensus
Dongyang Fan, Celestine Mendler-Dünner, Martin Jaggi
Conference on Neural Information Processing Systems (NeurIPS), 2023
codes / arXiv / poster

Miscellaneous

In general I like arts and cultural stuff. I am also an outdoorsy person and I do hiking skiing and sailing.

I paint from my hiking trips. For example...

Figure 1
Figure 2

Source codes of the website are from here.