Publications related on Large Language Models:


  1. Hayden Helm, Carey E. Priebe, "When prompt perturbations break your A/B test: A valid statistical test for generative surveying," Submitted, 2026


  2. Hayden Helm, Carey E. Priebe, Brandon Duderstadt,, "Control Charts for Multi-agent Systems," Submitted, 2026


  3. Hayden Helm, Merrick Ohata, Carey E. Priebe, "Black-box model classification under the discriminative factorization," Submitted, 2026


  4. Hayden Helm, Ben Johnson,, Carey E. Priebe, "Query-efficient model evaluation using cached responses," Submitted, 2026


  5. Maximilian Baum, Aranyak Acharyya, Tianyi Chen, Avanti Athreya, Youngser Park, Francesco Sanna Passino, Carey E. Priebe, Zachary Lubberts, "A mathematical framework for parameter recovery in large language models via a joint Euclidean mirror," Submitted, 2026


  6. Michael Browder, Kevin Duh, J. David Harris, Vince Lyzinski, Paul McNamee, Youngser Park, Carey E. Priebe, Peter Viechnicki, "Data Kernel Perspective Space Performance Guarantees for Synthetic Data from Transformer Models," Submitted, 2026


  7. Aranyak Acharyya, Joshua Agterberg, Youngser Park, Carey E. Priebe, "Concentration bounds on response-based vector embeddings of black-box generative models," Submitted, 2025


  8. Lingyou Pang, Lei Huang, Jianyu Lin, Tianyu Wang, Akira Horiguchi, Alexander Aue, Carey E. Priebe, "Unsupervised Conformal Inference: Bootstrapping and Alignment to Control LLM Uncertainty," Submitted, 2025


  9. Aranyak Acharyya, Carey E. Priebe, Hayden S. Helm, "Testing for LLM response differences: the case of a composite null consisting of semantically irrelevant query perturbations," Submitted, 2025


  10. Aranyak Acharyya, Michael W. Trosset, Carey E. Priebe, Hayden S. Helm, "Consistent estimation of vector embeddings of black-box generative AI models," Joint Statistical Meetings, Nashville, TN, August 2 - August 7, 2025 (DARPA)


  11. Zekun Wang, Runbing Zheng, Youngser Park, Carey E. Priebe, "Representing a Collection of Large Language Models as a Gaussian Mixture," Joint Statistical Meetings, Nashville, TN, August 2 - August 7, 2025 (DARPA)


  12. Hayden Helm, Aranyak Acharyya, Brandon Duderstadt, Youngser Park, Carey E. Priebe, "Statistical inference on black-box generative models in the data kernel perspective space," The 63rd Annual Meeting of the Association for Computational Linguistics, Vienna, Austria, July 27-August 1st, 2025


  13. Tianyu Wang, Lingyou Pang, Akira Horiguchi, Carey E. Priebe, "LLM Web Dynamics: Tracing Model Collapse in a Network of LLMs," Submitted, 2025 (AFOSR, DARPA)


  14. Edward Wang, Tianyu Wang, Avanti Athreya, Vince Lyzinski, Carey E. Priebe, "Gaussian mixture models as a proxy for interacting language models," Submitted, 2025 (AFOSR)


  15. Hayden Helm, Tianyi Chen, Harvey McGuinness, Paige Lee, Brandon Duderstadt, Carey E. Priebe, "Toward a digital twin of U.S. Congress," Submitted, 2025


  16. Hayden Helm, Tianyi Chen, Harvey McGuiness, Paige Lee, Brandon Duderstadt, Carey E. Priebe, "Toward a digital twin of U.S. Congress," 2025 Symposium on Data Science & Statistics, Salt Lake City, Utah, April 29 - May 2, 2025


  17. Cencheng Shen, Darren Edge, Jonathan Larson, Carey E. Priebe, "Explaining Categorical Feature Interactions Using Graph Covariance and LLMs," Applied Network Science, accepted for publication, 2025


  18. Aranyak Acharyya, Michael W. Trosset, Carey E. Priebe, Hayden S. Helm, "Consistent estimation of generative model representations in the data kernel perspective space," Submitted, 2025


  19. Hayden Helm, Aranyak Acharyya, Brandon Duderstadt, Youngser Park, Carey E. Priebe, "Statistical inference on black-box generative models in the data kernel perspective space," Submitted, 2024


  20. Michael W. Trosset, Carey E. Priebe, "Continuous Multidimensional Scaling," Submitted, 2024


  21. Harvey McGuinness, Tianyu Wang, Carey E. Priebe, Hayden Helm, "Investigating social alignment via mirroring in a system of interacting language models," Submitted, 2024


  22. Robert Osazuwa Ness, Katie Matton, Hayden Helm, Sheng Zhang, Junaid Bajwa, Carey E. Priebe, Eric Horvitz, "MedFuzz: Exploring the Robustness of Large Language Models in Medical Question Answering," Submitted, 2024


  23. Hayden Helm, Carey E Priebe, Weiwei Yang, "A Statistical Turing Test for Generative Models," Submitted, 2023


  24. Brandon Duderstadt, Hayden S. Helm, Carey E Priebe, "Comparing Foundation Models using Data Kernels," Submitted, 2023


  25. Hayden Helm, Brandon Duderstadt, Youngser Park, Carey E. Priebe, "Tracking the Perspectives of Interacting Language Models," Conference on Empirical Methods in Natural Language Processing, November 12–16, Miami, Florida, 2024. publisher site

Last Modified: