Recent Projects

2025

An implementation of the Mixed ANOVA Test from end to end including power analysis, testing, and post-hoc-testing. This test is the appropriate test for comparing the means of multiple groups over time in a Longitudinal Study scenario such as comparing Satisfaction Scores from user surveys taken each year.

A binary classification project on the Banana Quality Dataset. Here I go over some of my tips and tricks for classifying when explainability is more important than accuracy. Specifically I'll go over how to tune better using Optuna's enque_trial() function as well as show how to identify hidden thresholds in your data using SHAP values. You'll now be able to tell when exactly a feature's value goes from being favorable to unfavorable through a data driven approach, no more blindly comparing against the mean average.

A tree-based regression project on the Universal Top Spotify Songs Dataset. I'm actually reworking my first Data Science project here where we'll first use regression to identify the features of a hit song and then and the 1-NN algorithm to identify an exemplar song from the dataset. Specifically I'll go over the logit transformation for bounded but continuous variables, how to tune better using Optuna's enque_trial() function, and show how to identify hidden thresholds in your data using SHAP values.

Evaluating Multimodal LLMs that can... Think? with Gemini 2.0 Flash Thinking, the MATH Vision Dataset (HuggingFace), & LLM as a Judge

Here I'll evaluate the new Gemini 2.0 Flash Thinking model against the MATH Vision dataset using the LLM as a Judge technique. I'll show that this new model has impressive state of the art accuracy and just might be the best scoring model available right now.

The Causal Effect of the Maui Wildfires on the Unemployment Rate with CausalImpact (python) and Dynamic Time Warping (tslearn)

Here's a short tutorial on evaluating the estimated casual effect from the recent Maui wildfires on the local unemployment rate using data from the Bureau of Labor Statistics (BLS). Rather than measure the correlation or association, here I'll dive into causality using the python implememntation of the CausalImpact library and use the Dynamic Time Warping algorithm from tslearn in an attempt to choose the most similar counties to measure against as controls ( a form of Market Matching ).

Talk to Your BigQuery Data with GCP (VertexAI PaLM 2 LLM) and LangChain

Here's a short tutorial on how you can set up a LLM on GCP to talk to your BigQuery data through VertexAI using the PaLM 2 LLM... otherwise known as Table Q&A. I'll store a sample HR Attrition dataset from IBM in BigQuery and then set up a LLM in order for us to chat with the data. We'll be able to ask it simple questions and validate it's answers all in only a few lines of code.

Here is a comprehensive project from end to end where I analyze survey data to find measureable company metrics which are most relevant to the survey to use as proxy metrics for measurement throughout the year. Also these survey responses are fed into Google's Gemini model for sentiment analysis and topic summarization, at scale.

An implementation of Welch's t-Test for Means from end to end. This test is the appropriate test for comparing the means of two groups with presumed unequal variances in an A/B Testing scenario such as comparing Blood Pressure from after a new medicine trial.

An implementation of Mann-Whitney U-Test for Comparing Groups from end to end. This test is the appropriate test for comparing the data of two groups with presumed unequal variances or non-Normality in an A/B Testing scenario such as comparing Satisfaction Scores from a user survey.

An implementation of MANOVA from end to end including power analysis, testing, and post-hoc-testing. This test is the appropriate test for comparing the means of multiple groups and multiple dependant variables whos data is Multivariate Normal and have similar variance in an A/B/C testing scenario such as comparing CTR, Time Spent on a web site, and Spend from different user groups who have experienced different versions.

An implementation of ANOVA from end to end including power analysis, testing, and post-hoc-testing. This test is the appropriate test for comparing the means of multiple groups whos data is Normal and have similar variance in an A/B/C/D testing scenario such as comparing Time Spent on a web site from different user groups who have experienced different versions.

An implementation of Wilcoxon Signed-Rank Test from end to end. This test is the appropriate test for comparing the data of one group before and after an intervention whos differences may be non-Normal in an A/B Testing scenario such as comparing Performance Scores on a 0-100 scale.

Whether you're a believer or not, you might find some of these lesser known BigQuery functions interesting. I'll show in this tutorial how to use these geo functions along with free boundary datasets from the US Census Bureau hosted in BigQuery's Marketplace to measure distance, look within a County or Zip Code, and even do some geo Clustering.

Here I'll go over my methods for classifying and labeling both univariate outliers and multivariate anomalies. We'll go through a statistical outlier ensemble (z-score, modified z-score, and IQR) based on the skew of the data as well as a machine learning voting ensemble based on PyOD algorithms (ECOD, iForest, LOF, OCSVM, and AutoEncoder). Finally we'll visualize, inspect, and correct any outliers or anomalies we find.

More Projects

2024

MLOps on GCP: Upcoming Local Shows Playlist (DataOps)

This will be Part 1 of a tutorial on how to create a simple Flask web app, which will ultimately help a user create a playlist on their Spotify account containing the most popular songs from artists that will be playing in their area in the upcoming months. Part 1 will set up a simple ETL data process through GCP focusing on pulling data from the APIs of both Spotify and SeatGeek, combing the data, and then uploading/automating the process through GCP using App Engine, Cloud Scheduler, Cloud Storage, and Secret Manager.

An implementation of Student's Paired t-Test for Means from end to end. This test is the appropriate test for comparing the means of one group sampled twice (once before and once after an intervention) with small-ish to large sample sizes in an A/B Testing scenario.

An implementation of Student's Unpaired t-Test for Means from end to end. This test is the appropriate test for comparing the means between 2 independent but similar groups with small-ish to large sample sizes in an A/B Testing scenario.

An implementation of McNemar's Test for Proportions from end to end. This test is the appropriate test for comparing the proportion of binary data between 2 paired groups in an A/B Testing scenario.

An implementation of the Z-Test for Proportions from end to end. This test is the appropriate test for comparing the proportion of binary data between 2 independent groups with different and large sample sizes in an A/B Testing scenario.

An implementation of Fisher's Exact Test for Proportions from end to end. This test is the appropriate test for comparing the proportion of categorical data between 2 independent groups with small sample sizes in an A/B Testing scenario. I'll also go over Barnard's and Boschloo's Exact tests which are both considered improvements to Fisher's test.

An implementation of the Binomial Test for Proportions from end to end. This test is the appropriate test for comparing the proportion of binary data between 2 independent groups with different sample sizes in an A/B Testing scenario.

An implementation of the Chi-Squared Test for Proportions from end to end. This test is the appropriate test for comparing the proportion of categorical data between 2 or more groups in an A/B Testing scenario.

4 Bayesian Regressions with Linear, Binomial Logistic, Poisson, and Exponential

Here's a walkthrough of 4 different flavors of Bayesian regression with inference, each around a seperate case study or scenario using synthetic data. This might be interesting for someone who is familiar wth the concept of regression and has always wondered what the fuss is with Bayesian statistics. You'll see that while it might require the use of pymc, a library for Bayesian computation, the structure is very similar to the Frequentist approach. You might even find that inference with Bayesian statistics is more flexible and more insightful.

2023

Bayesian Hierarchical A/B Testing with pyMC3

For this product Data Science project I’ll explore the use of Bayesian Inference in A/B testing using the PyMC3 library. Using synthetic data, the idea behind the project will be to test 4 new playlist algorithms against the current algorithm. The metrics will focus on user interaction during the first selected song, and the metrics measures will be the skip rate and the average time it took a user to skip the song.

Hawai'i Median Home Price Forecasting with prophet

In this project I'll attempt to forecast Hawai'i Median Home Prices with the prophet library, and explore some intermediate features while doing so. I'll take a look at seasonality, changepoints, growth modes, anomaly omission, and prior scales in order to find plausibly accurate forecast for Home Price. And while this typically would be fairly straight forward, we'll see that the pandemic has gievn us some volatility that needs to be accounted for in order to find a nice fitting model.

Mauna Loa Forecasting CO2 Emissions with prophet

This will be a very short project where I'll forecast CO2 emmissions recorded on top of Mauna Loa on the Big Island of Hawai'i using the prophet library in python. While it won't be the most complex trend, I mostly wanted to forecast this data having lived on the Big Island for a handful of years and even walked right up to the lava flow a few times... I couldn't pass up the opportunity. Plus prophet is just so easy to setup and use, even tuning it is fairly straight forward. I'll probably circle back and do a more complicated forecast later on, but for now let's holo holo!

2022

The Best Reason to Practice Bayesian Statistics that You Haven't Heard of

This post is is dedicated to Bayesian Statistics, in particular how it gracefully handles summarizing parameters for multimodal data using the Highest Density Interval or HDI (also referred to as the Highest Posterior Density or HPD). Here I'll show how it's possible to generate two or more credible intervals for your parameters.

Upcoming Local Shows Playlist with the Spotify API

Here I scrape the bandisintown.com website for local upcoming shows, then connect that data to the Spotify API through fuzzy string matching on the artist name in order to generate a Spotify playlist of their 3 most popular songs. Now we can quickly and easily explore artists music that will be playing live ni our area soon, and decide if we'd like to go out and catch a show.

Machine Learning to Predict Student Grades with CatBoost and Optuna

Predicting student grades with quantitative and qualitative data at various time periods of the year using Catboost Regression/Classification and Optuna for Hyperparameter tuning.

Deep Learning in Tableau Using a Keras Neural Network with TabPy

Lately I’ve been experimenting with utilizing the Analytics Extensions in Tableau Desktop. I haven’t quite seen anyone incorporate any Keras Deep Learning models yet, so I thought it would be a good challenge to explore the possibilities. Here I used a data set containing flight data for a handful of airports to try to predict whether or not a future flight will be delayed with a Keras Deep Learning model. Then I will deploy the model to Tableau Desktop using the TabPy package.

Machine Learning in Tableau Using R and Dynamic K-Means Clustering

Here is a simple tutorial for using the R statistical language in Tableau for more advanced ML features including Clustering. Although Tableau has recently introduced some Clustering functionality, I wanted to explore connecting my Tableau workbook with the R statistical language for a more nuanced and tunable approach.

Car Financing Conversion A/B Test with scipy and statsmodels

For this project I used a synthetic dataset to analyze the results of an A/B test where a fictional car financing company experimented with lowering the APR given to a customer with the hopes of increasing both their sales and margins.

2021

Some projects from my grad school days. Also back when twitter was twitter!

Optimal Airline Overbooking Seat Number Simulation in R

Here is a project using statistical computer simulation to answer the question of how many overbooked seats results in the highest revenue for an airline.

SIR Model Monte Carlo Simulation of Pandemic Flu Spread in a Classroom Setting in python

Here is a project where I used statistical computer simulation to model a pandemic flu outbreak in a classroom setting.

Twitter's 50 Trending Topics Sentiment Analysis & Dashboard with GCP and the Twitter API

Training, Automating, and Deploying a Custom Deep Neural Network on the Google Cloud Platform.

2020

These are kind of outdated. I'm pretty sure this was pre-GPT 2.0 days, before LLMs were a thing or at least around that time. But I'm fairly confident I was the first person to publish an article on how to access the Spotify API for podcast data as they had just released that endpoint.

One Million Podcasts Analyzed with the Spotify API

This is a follow up project to my last article where I showed how to access the Spotify API specifically for podcast data. Previously I showed how you can amass a dataset of podcast information related to a specific search term such as “data science”. Here I will show how you can join several of these datasets together, thus enabling you to look for trends among a very large dataset composed of several search topics.

Spotify Podcast Data with the Spotify API

A tutorial on how to access the Spotify API specifically for podcast data, and what you can potentially do with it! For this project I will show how to gather podcast data for every show and episode related to a search term such as “data science”.

Creating a Willie Nelson Inspired Song with textgenrnn and spaCy

For this project I wanted to try my hand at some text generation. And as a Willie Nelson fan, I was hoping to use it to write a brand new Willie Nelson Song based off of all of his song lyrics. The lyrics were obtained easily through the Genius API and the text generation was performed with textgenrnn and spaCy.

2019

These are pretty old! There are probably many modeling mistakes, programming bad practices, etc. I wouldn't follow these too closely, but I'm going to keep them here because I still think they're interesting and to show growth.

The Big Takeover Band: 12+ Years of Real Tour Data

I’ve been playing around a lot lately with the Spotify API, but I was wondering what other kind of musical data is out there. Was there any live show data I could play around with? So I reached out to my former band mates in The Big Takeover and asked them if they had anything. Turns out the excel spreadsheet I had started 12+ years ago was still going strong, full of years worth of shows, pay, attendance, etc… And boy was it a mess (almost as messy as the band van)! For this project I wanted to see what kind of value I could create for them – if I could clean it up well enough first!

How to Write a Bestselling Book in 2020 with the New York Times API

For this article I will show you how to obtain and create a dataset from the New York Times API, containing books from one of their many weekly “Bestseller’s” lists from over a decade’s worth of publications. In this dataset you will find descriptions of all of the books that made it onto their bestsellers lists, as well as the titles and authors names. After some human data analysis I will then use Machine Learning techniques (Markov Chain Ensemble) for text generation to create a description of what we think a new bestselling book would be in 2020.

Get in touch at: mr.sam.tritto@gmail.com

Page updated

Google Sites

Report abuse