• Machine Learning, Big Data, Distance Metrics, K Nearest Neighbors, Clustering, Decision Trees, Ensemble Methods, Dimensionality Reduction, Pipeline Building, Hyperparameter Tuning, Grid Search, and Scikit-Learn
• Deep Learning, Natural Language Processing, Neural Networks, Convolutional Neural Networks, Ngrams, POS Tagging, Text Vectorization, Context-Free Grammars, Neural Language Toolkit, Regular Expressions, Word2Vec, and Text Classification, Tableau, Hive, SparkData Engineering, Data Cleaning, Pandas, NumPy, Matlotlib/Seaborn for Data Visualization, Git/Github, Data structures, Relational Databases, SQL, Object-Oriented Programming, NoSQL databases, MongoDB, JSON, HTML/XML, Accessing Data Through APIs and CSS Web Scraping
• Probability Sampling, AB Testing, Combinatorics, Probability Theory, Statistical Distributions, Bayes Theorem, Naive Bayes Classifier, Sampling Methods, Monte Carlo Simulation, Hypothesis Testing and AB Testing
• Statistical Modeling, Linear Algebra, Linear Regression and extensions, Polynomials, Interaction effects, Logistic regression, Optimization Cost Function, Gradient Descent, Maximum Likelihood Estimation, Time Series Modeling, Regularization and Model Validation
• Python for Data Science, Variables, Booleans and Conditionals, Lists, Dictionaries, Looping, and Functions