DE / EN

Current projects

Group of sudents attending a lecture
Speeding up Social Science

The scope of this research cooperation between the groups of Prof. Ratkovic, Prof. Gschwend and Springer Nature is to investigate how AI may support researchers in writing scientific articles.
The kick-off event for this project was a very successful hackaton in September 2024 – please find the news coverage for this event in our News tab.

Viral meme of distracted boyfriend with a twist on social science
Large Language Models for Statistical and Causal Inference

We explore how LLMs can serve as practical tools for statistical and causal inference. Our innovation, context augmentation, bridges the gap between text and inferential questions by leveraging model-generated contexts. This approach allows classical methods like the two-sample test, regression analysis, and causal inference. More broadly, we are developing powerful new tools for extracting insights from the unstructured complexity of language.

A picture of server racks.
GPU Servers for Research Purposes in Social Sciences

We are developing a GPU server system for research at the Chair.  The system is commercial-grade, with access via a friendly web interface or CLI.  The server is tailored for social science research, with pre-configured methods for working with large text, image, video, and multi-modal data.  The work is done in conjunction with KoehnAI.

Publications

Invited Talks

  • “Using LLMs for Regression and ANOVA: Statistical Inference and Testing of Text Data with Large Language Models”. With Haoyu Zhai. 7th International and Interdisciplinary Conference on the Quantitative and Computational Analysis of Textual Data (COMPTEXT), Vienna / Austria. (April 26, 2025)
  • “Large Language Models for Statistical Inference: Context Augmentation with Applications to the Two-Sample Problem and Regression.” Polmeth Europe 2025, London / UK. (April 7, 2025)
  • Invited talk at the Aix-Marseille School of Economics, Marseille / France. (March 24, 2025)
  • Invited speaker at the  Panel discussion on the US elections organized by the student representatives of the Departments of Sociology and Political Science of the University of Mannheim, Mannheim / Germany (October 22, 2024).
  • Invited speaker at the plenary panel Community insights on AI readiness. STM Annual Conference 2024 – Advancing trusted research in the AI era, Frankfurt a.M. / Germany. (October 14, 2024)
  • “Context Augmentation for the Statistical Analysis of Text.” With Haoyu Zhai. 6th International and Interdisciplinary Conference on the Quantitative and Computational Analysis of Textual Data (COMPTEXT), Amsterdam / Netherlands. (May 4, 2024)
  • “Estimation and Inference on Nonlinear and Heterogeneous Effects.” International Conference on Machine Learning (ICML). With Dustin Tingley. Virtual conference. (September 10, 2021)
  • “Rehabilitating the Regression: Honest and Valid Causal Inference through Machine Learning.” University of Michigan ISQM Seminar, Ann Arbor, MI / U.S.A. (November 4, 2019)
  • “Rehabilitating the Regression: Honest and Valid Causal Inference through Machine Learning.” New York University Methods Seminar, New York, NY / U.S.A. (October 28, 2019)
  • “Rehabilitating the Regression: Honest and Valid Causal Inference through Machine Learning.” Duke University Methods Seminar, Durham, NC / U.S.A. (May 13, 2019)
  • “The Effects of Political Institutions on the Extensive and Intensive Margins of Trade.” V-Dem Institute, Gothenburg / Sweden. (May 20, 2018)
  • “Causal Inference through the Method of Direct Estimation.” Ulric B. and Evelyn L. Bray Social Science Seminar Series, California Institute of Technology, Pasadena, CA / U.S.A. (February 26, 2018)
  • “Causal Inference through the Method of Direct Estimation.” Yale Methods Seminar, Yale University, New Haven, CT / U.S.A. (March 8, 2017)
  • “Causal Inference through the Method of Direct Estimation.” IQSS, Harvard University, Cambridge, MA / U.S.A. (April 27, 2017)
  • “What If? Machine Learning for Causal Inference.” Harvard University and MIT, Cambridge, MA / U.S.A. (February 2016)
  • “A Sparse Decomposition of Democracy and Trade”. Panel discussion, Annual Summer Meeting, Society for Political Methodology, University of Georgia, Athens, GA / U.S.A. (July 26, 2014)
  • Summer School in Quantitative Methods, Empirical Political Economy Network. CIDE, Mexico City / Mexico. (May 19–20, 2014)
  • “Covariate Balancing Propensity Score.” Panel discussion, Annual Meeting of the Society for Political Methodology. University of North Carolina-Chapel Hill, Chapel Hill, NC / U.S.A. (July 19, 2012)
  • Machine Learning Seminar. Department of Computer Science, Princeton University, Princeton, NJ / U.S.A. (March 14, 2012)
  • “Achieving Optimal Covariate Balance Under General Treamtent Regimes.” Thomas R. Ten Have Memorial Presentation. Atlantic Causal Inference Conference, Johns Hopkins University, Baltimore, MD / U.S.A. (May 25, 2012)
  • Experiments Seminar. Institution for Social and Policy Studies, Yale University, New Haven, CT / U.S.A. (September 8, 2011)
  • “Achieving Optimal Covariate Balance Under General Treatment Regimes.” Annual Meeting of the Society for Political Methodology. Princeton University, Princeton, NJ / U.S.A. (July 29, 2011)
  • Annual Meeting of the Midwest Political Science Association. Chicago, IL / U.S.A. (April 1, 2011)
  • “Identifying the Effects of Political Boundaries: Simultaneous Variable Selection and Curve Fitting through Mixed-Penalty Regularization.” New Faces in Political Methodology Conference. Quantitative Social Science Initiative, Pennsylvania State University, University Park, PA /U.S.A. (May 1, 2010)