Professional Documents
Culture Documents
Links\\
: aasingh4@usfca.edu |
| Website |
: (415)-745-6473
EDUCATION
Masters of Science in Analytics (MSAN)
University of San Francisco | <2015 2016 (Pursuing) >
Interests: Machine Learning, Time series, SQL & NOSQL databases, Distributed computing
Python, R, SAS, Cassandra, MsSQL, Postgresql, HIVE & Hadoop, MongoDB, Shiny studio, D3.js
Apache Drill, AWS (S3, EMR, EC2, RDS), h2o, Pyspark (Mlib, Streaming & SQL), SSH, Git & Bash
Produced insights & visualization patterns to identify installation bottlenecks & faulty data patterns
Created a AWS-ML based UI to help Sunrun predict & track Project installation times
Performed stemming & Stop word removal on 100k movie reviews from Stanford database
Fitted a Naive Bayes Classifiers in both Spark and Python and got 83% accuracy
RESTful Web Services on AWS (Tools used: Python, Flask, EC2, RDS)
Parsed JSON files (1.5 GB) on Product sales and loaded them in AWS-RDS
Analyzed the 5 biggest drivers of sales and hosted the results online (backed by an EC2)
Crawled OMDB & Box Office Mojo to pull movie data for predicting movie success
Identified the 3 biggest drivers of revenues for movies released between (1980 2015)
Audio Clip Classification (Tools used: IPython notebook, Python, OpenCV, R, h2o)
Processed audio files, derived features and transferred data to h2o environment
Fitted & tuned a Gradient boosting classifier to get a test AUC of .997
Employed missing value substitution, Outlier treatment & feature engineering for an insurer
Tuned a Gradient boosting classifier to get a Kaggle test AUC of .964