Skip to content

Media Big Data Analysis using Spark & AWS EMR, Data Storytelling, Build ETL & Machine Learning Pipeline for NLP tasks, A/B Testing Design & Post-Analysis & Customer Targeting, build time series dashboard using Flask and Plotly, Recommendation System

Notifications You must be signed in to change notification settings

Ting-DS/Data-Scientist-Nanodegreee-Udacity

Repository files navigation

Data Scientist Nanodegreee - Udacity

Introduction

This repository contains 5 DS projects and insights during my pursuit of the Data Scientist Nanodegree at Udacity:

Data Science Basic Skills

  • Python, SQL, SAS, R; Data Visualization; Command Line Essentials, Git & GitHub; Practical Statistics, Linear Algebra, Machine Learning

Part 1. Cross-industry Standard Process for Data Mining CRISP-DM

Part 2. Software Engineering

Part 3. Data Engineering

Part 4. Experimental Design (A/B Testing)

Part 5. Recommendation System

Capstone Project. Big Data Analysis with Spark

Licensing, Authors, Acknowledgements

I would like to extend my sincere gratitude to data science teams in

for their contribution in making this valuable resource available to the public. A special acknowledgment goes to Udacity for their exceptional guidance throughout this project. Feel free to utilize the contents of this work, and when doing so, please remember to appropriately attribute the contributions of myself, and/or Udacity.

About

Media Big Data Analysis using Spark & AWS EMR, Data Storytelling, Build ETL & Machine Learning Pipeline for NLP tasks, A/B Testing Design & Post-Analysis & Customer Targeting, build time series dashboard using Flask and Plotly, Recommendation System

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published