Research



Contributions

Carnegie Mellon University, Tepper School of Business

Denes, Matthew and Lagaras, Spyridon and Tsoutsoura, Margarita, 2025, Entrepreneurship and the Gig Economy: Evidence from U.S. Tax Returns. Journal of Financial Economics Denes, Matthew and Lagaras, Spyridon and Tsoutsoura, Margarita, 2025, Evolution of the Relationship between the Gig Economy and Entrepreneurship: The Heterogeneous Effects of Labor Market Disruptions. Working Paper Denes, Matthew and Lagaras, Spyridon and Tsoutsoura, Margarita, 2023, First Come, First Served: The Timing of Government Support and Its Impact on Firms. Working Paper. Denes, Matthew and Lagaras, Spyridon and Tsoutsoura, Margarita, n.d., Take-up of Flexible Labor. Working Paper.

Harvard Business School

Palmer, Jonathan and Zoeller, Collin, 2023, The Impact of the Kodak Crash on Entrepreneurship in Rochester, NY.


Conferences

BYU President's Leadership Council

Zoeller Collin (2023) "Growing together: Growing the Tree" May 2023. Brigham Young University, Provo, UT. Presentation to the top investors and business leaders in the state of Utah who are members of the BYU President's Leadership Council. and direct funding to support BYU programs. This presentation was a summary of the research and progress made in building automated indexing software for geneaology using AI and machine learning.

Family History Technology Workshop

Zoeller, Collin (2023) "City Directory Automated Indexing". April 2023. Brigham Young University, Provo, UT.


Projects and Packages

StataHelper

A Python package that simplifies the Pystata interface and performs parallelized Stata tasks. This is especially useful for those who are not familiar with Stata's syntax or who are looking to leverage Python's usability with Stata's functionality.
GitHub | PyPI

Explain

A Stata package that integrates Stata with almost any LLM to assist with code debugging, explanation, and improvements. Supports both hosted and local models.
GitHub

Large-scale Efficient Text Classification with ANNOY + SLM

A project that demonstrates how to use the ANNOY library to build a large-scale multi-class text classification model using SLM embeddings. The combination of lightweight SLM embeddings and the ANNOY (type of Approximate Nearest Neighbors) algorithm proves a powerful and efficient way to classify large amounts of text data.
Github

StataAgent (current project)

A specialized agent model that integrates with Stata for question-based data analysis. Inputs natural language questions like "What is the average income of individuals in 2022?" and queries Stata to return the answer.
GitHub


Medium and More

Zoeller, Collin (2024) "A Gentle Introduction to Quantum Computing." Medium.
Zoeller, Collin (2024) "Leveraging Stata with Python: Part 1." Medium.
Zoeller, Collin (2024) "Leveraging Stata with Python: Part 2." Medium.