I'm a data scientist, engineer, and aspiring software developer.
I love to build things - that's why I became an engineer! Along the way, I fell in love with electronics and programming, despite classically training as a Chemical Engineer, because it was so easy to rapidly iterate on ideas and see them come to life in real-time. Throughout my 9+ years in industry I've been fortunate enough to partner with some amazing people in supply chain, marketing, E-Commerce, and payments! When I'm not working as an analytics manager at Visa, I'm building things, traveling, gardening, and learning a new instrument!
Pre-trained models that over-rely on popular annotated datasets, like SNLI and MNLI, can learn dataset specific “Artifacts”. This hinders model generalization and leads to poor performance when deployed to production.

CNN (Convolutional Neural Networks) are a type of feed-forward deep neural network that leverages convolution layers for visual processing. In this project, I leverage a low-level ‘auto-pilot’ controller to train an AI vision based driving system to complete a course of SuperTuxKart.

“Big Data” no longer requires a complicated cluster of machines and a PhD. We can use Docker to run a local analytics stack! I’ve used this stack to train teams at Amazon/WFM on the internals of PySpark and Docker.

Most business stakeholders err away from using CLIs (Command Line Interfaces) or from writing complicated SQL. Instead, I designed and built an application that accepts input, queries a MS SQL Server Database, and returns results for master data management teams to maintain tables in SAP.

SARIMA (Seasonal Auto Regressive Integrated Moving Average) is a widely used model that brings components of different Time Series forecasting methods to create a more robust way to forecast. We can use SARIMA to predict the “one-year-in-advance” prediction of temperature for January 2018.

Kalman Filters have many applications in aerospace, navigation, econometrics, etc. and are often applied in time series analysis for signal processing. We are able to estimate a joint probability distribution over the variables by taking the uncertainty into account.