Worked with an electronics company to predict HDD failure using historical data accumulated over two years from 120,000 HDDs. Performed feature reduction through t-tests and imbalanced data correction using SMOTE. Coded in R and Nysol.
2019-2 - 2019-3
Factory Process Failure Prediction
Worked with an automotive company to build a model that could distinguish between good and bad parts cut out of sheet metal, using data collected from factory machine sensors. Performed complicated data shifting based on factory line speed and distance. Coded in R.
2018-11 - 2019-1
Machinery Parts Association Analysis
Worked with a farming equipment company to perform Market Basket Analysis on data of purchases of their equipment parts. Created a sample process in RapidMiner and documentation explaining the process and results interpretation.
2018-10 - 2018-11
Correlation Analysis and Factory Process Quality Measurements Prediction
Worked with an electronics company to assesses the usability of their data in a parameter adjustment algorithm. Performed a correlation analysis to establish which machine parameters were strongly correlated with the quality measurements and developed models that used the machine parameters to predict the quality measurements. Coded in R.
2018-7 - 2018-9
New Store Sales Prediction
Worked with a marketing company to build a two-stage regression model using Python that could predict a new stores customer count in its first year based on past data from existing stores and population statistics. Achieved greater than 70% accuracy.