-
Figuring Out Star Schema in Power BI/ June 19, 2025
After developing my first PowerBI project, I learn and implement preferred practices in semantic modeling and the star schema approach to dashboard development.
-
Cleaning and Labelling Data with LLMs/ June 3, 2025
My method of using GPT-4 to clean 1,229 sometimes awkward, abbreviated HS-4 commodity descriptions into clear, natural language descriptions.
-
Estimating LHRR Race Day Temperature/ May 2, 2025
Estimate the probability distribution of temperatures on a specific calendar day (June 9th) in a specific town (Litchfield, CT) at a specific time (1:00pm)
-
Creating the Neural Network Menu/ March 24, 2025
Reviewing how I created the interactive navigation menu you see on the left sidebar of this site. I learned about DOM, even listeners, and basic CSS/JavaScript tradeoffs.
-
Modeling Socioeconomic Ascent in Connecticut Census Tracts/ March 13, 2025
Using public Census data, I applied data mining algorithms to model the socioeconomic correlations of economic growth on the 884 official Census tracts of Connecticut to generate 5-year forecasts for the year 2028.
-
Using Principal Component Analysis to Produce a Composite Variable for Socioeconomic Analysis/ February 19, 2025
As part of my M.S. thesis, I used PCA to compress 3 variables into a single variable for use as my predictor variable. I discuss tradeoffs of PCA in the context of my specific problem case.
-
A Simple Notification System for Amazon Book Wishlist/ January 8, 2025
A Python-based notification system that monitors changes in the price or availability of books on your Amazon wishlist. Users receive email alerts when key changes occur, helping them stay updated on their desired books.
-
Quick Deployment of a Simple Machine Learning Model with Streamlit/ December 27, 2024
How to deploy a simple machine learning model using Streamlit, turning a trained model into an interactive web app.
-
Generalizing Regression: t-tests and OLS/ December 23, 2024
Exploring the idea that t-tests for comparing group means are mathematically equivalent to performing OLS regression with a binary predictor, highlighting the flexibility and power of regression analysis.
-
Discovering geosnap/ December 16, 2024
-
A B Testing Email Campaign Times/ December 9, 2024
-
Basic ETL Using U.S. Census Trade Data/ November 19, 2024
-
Notes on the Bias/Variance Tradeoff/ November 14, 2024
-
U.S. State Exports Dashboard in Power BI/ October 8, 2024
How I approaching building my first serious Power BI dashboard using U.S. international export data
-
Developing REST API in Python and Consuming it in a Flask Web Application/ August 7, 2024
-
Some notes on the SQL GROUP BY statement/ August 5, 2024
-
Implementing the Naive Bayes Classifier on the Iris Dataset/ August 2, 2024
-
Jacob Initial Post/ July 26, 2024