Research
Contributions
Carnegie Mellon University, Tepper School of Business
Denes, Matthew and Lagaras, Spyridon and Tsoutsoura, Margarita, 2025,
Entrepreneurship and the Gig Economy: Evidence from U.S. Tax Returns. Journal of Financial Economics
Denes, Matthew and Lagaras, Spyridon and Tsoutsoura, Margarita, 2025,
Evolution of the Relationship between the Gig Economy and Entrepreneurship:
The Heterogeneous Effects of Labor Market Disruptions. Working Paper
Denes, Matthew and Lagaras, Spyridon and Tsoutsoura, Margarita, 2023,
First Come, First Served: The Timing of Government Support and Its Impact on Firms. Working Paper.
Denes, Matthew and Lagaras, Spyridon and Tsoutsoura, Margarita, n.d.,
Take-up of Flexible Labor. Working Paper.
Harvard Business School
Palmer, Jonathan and Zoeller, Collin, 2023,
The Impact of the Kodak Crash on Entrepreneurship in Rochester, NY.
Conferences
BYU President's Leadership Council
Zoeller Collin (2023) "Growing together: Growing the Tree" May 2023. Brigham Young University, Provo, UT.
Presentation to the top investors and business leaders in the state of Utah who are members of the BYU President's Leadership Council.
and direct funding to support BYU programs. This presentation was a summary of the research and progress made
in building automated indexing software for geneaology using AI and machine learning.
Family History Technology Workshop
Zoeller, Collin (2023) "City Directory Automated Indexing". April 2023. Brigham Young University, Provo, UT.
Projects and Packages
StataHelper
A Python package that simplifies the Pystata interface and performs parallelized Stata tasks.
This is especially useful for those who are not familiar with Stata's syntax or who are looking to
leverage Python's usability with Stata's functionality.
GitHub |
PyPI
Explain
A Stata package that integrates Stata with almost any LLM to assist with code debugging, explanation, and improvements.
Supports both hosted and local models.
GitHub
Large-scale Efficient Text Classification with ANNOY + SLM
A project that demonstrates how to use the ANNOY library to build a large-scale multi-class text classification model
using SLM embeddings. The combination of lightweight SLM embeddings and the ANNOY (type of Approximate Nearest Neighbors)
algorithm proves a powerful and efficient way to classify large amounts of text data.
Github
StataAgent (current project)
A specialized agent model that integrates with Stata for question-based data analysis. Inputs natural language questions
like "What is the average income of individuals in 2022?" and queries Stata to return the answer.
GitHub
Medium and More
Zoeller, Collin (2024) "A Gentle Introduction to Quantum Computing." Medium.
Zoeller, Collin (2024) "Leveraging Stata with Python: Part 1." Medium.
Zoeller, Collin (2024) "Leveraging Stata with Python: Part 2." Medium.