Skip to content

This repository gathers some analysis made on various datasets encountered along my learning.

Notifications You must be signed in to change notification settings

cnoza/DataAnalysis

Repository files navigation

Data Science Portfolio

Welcome to this repository!

About me

I hold a M.S. in Physics and an Advanced M.S.c. in Financial Risk Management. I'm learning about Data Science techniques using Python and R (and I'm fascinated by it!)

Courses

I'm learning Data Science techniques mainly from online ressources such as blog posts, learning platforms such as Coursera, Dataquest.io, Datacamp and others (Kaggle, DataWorld, etc.). The training exercises are gathered in the Courses folder for reference.

Projects

Are also gathered here some projects applying the ML techniques learned throughout the courses. Here is a summarized list of techniques I had the opportunity to apply on various datasets:

  • Data acquisition: using API's, Web Scraping, SQLite, PostgreSQL
  • Statistics: Hypothesis testing, Regularized Linear & Logistic Regression (Lasso, Ridge, ElasticNet)
  • Supervised Machine Learning: Decision Trees, kNN, Random Forests, SVM
  • Unsupervised Machine Learning: K Means & Hierarchical Clustering, Dimension reduction (PCA, TruncateSVD, NMF)
  • Deep learning/Neural Networks: Forward/Backward prop, ReLu, Gradient descent, Keras, optimization methods
  • Natural Language Processing: Text tokenization using NLTK
  • Network analysis with NetworkX

See the Projects for more detailed information.

Installation

I'm using the Anaconda Python distribution and Jupyter Notebooks to present the projects. If you wish to run them on your own station, here is the yml file. Instructions on how to create an virtual environment and install it can be found here. For the most useful Conda commands, see here.

Data visualisation

I'm also enjoying learning how to build interesting data visualisations using Tableau. Click here to access my profile page on Tableau public.

Certification

To account for the various trainings followed, I'm gathering all the certificates earned through hard labor in the Certification folder.

About

This repository gathers some analysis made on various datasets encountered along my learning.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published