Top 50 Awesome List

academic/awesome-datascience

Computer Science  14 days ago  19.6k
📝 An awesome Data Science repository to learn and apply for real world problems.
View byDAY/WEEK/README
View on Github

Sep 16th

Learn Data Science

Deep Learning architectures

  • Transformer
  • Conditional Random Field (CRF)
  • MOOC's

  • Stanford Artificial Intelligence Professional Program
  • Sep 6th

    Bloggers

  • Aditi Rastogi - ML,DL,Data Science blog
  • Youtube Videos & Channels

  • Deep Learning Architectures
  • Sep 1st

    Learn Data Science

  • Book Deals
  • Jul 29th

    MOOC's

  • Recommender Systems Specialization from University of Minnesota is an intermediate/advanced level specialization focused on Recommender System on Coursera platform.
  • Tutorials

  • 1000 Data Science Projects you can run on the browser with ipyton.
  • Jun 27th

    Tutorials

    Free Courses

  • Skillcombo - Data Science - 1000+ free online Data Science courses
  • Jun 11th

    Books

  • How to Lead in Data Science - Early access
  • Jun 9th

    Books

  • Casual Inference for Data Science - Early access
  • May 30th

    COLLEGES

  • MS in Computer Information Systems @ Boston University
  • Bloggers

  • Datawrangling by Peter Skomoroch. MACHINE LEARNING, DATA MINING, AND MORE
  • Data Science 101 - Learning To Be A Data Scientist
  • Data Sets

  • NASDAQ:DATA - Nasdaq Data Link A premier source for financial, economic and alternative datasets.
  • National Centers for Environmental Information
  • Books

  • Dive into Deep Learning
  • May 12th

    Books

  • Julia for Data Analysis - Early access
  • Apr 13th

    Learn Data Science

    Semi-Supervised Learning

  • Clustering
  • COLLEGES

  • Master of Applied Data Science @ The University of Michigan
  • Tutorials

  • Python for Data Science: A Beginner’s Guide
  • Mar 7th

    Data Sets

  • SocialGrep - a collection of open Reddit datasets.
  • Feb 17th

    Books

  • Data Mesh in Action - Early access
  • Feb 15th

    Bloggers

  • Louis Dorard a technology guy with a penchant for the web and for data, big and small
  • Quora Data Science - Data Science Questions and Answers from experts
  • Wes McKinney - Wes McKinney Archives.
  • Greg Reda - Greg Reda Personal Blog
  • Kevin Davenport - Kevin Davenport Personal Blog
  • Julia Evans - Recurse Center alumna
  • Sean J. Taylor - Personal Web Page
  • Drew Conway - Personal Web Page
  • Noah Iliinsky - Personal Blog
  • Matt Harrison - Personal Blog
  • Prash Chan - Tech Blog on Master Data Management And Every Buzz Surrounding It
  • Clare Corthell - The Open Source Data Science Masters
  • Paul Miller Based in the UK and working globally, Cloud of Data's consultancy services help clients understand the implications of taking data and more to the Cloud.
  • Data Science London Data Science London is a non-profit organization dedicated to the free, open, dissemination of data science. We are the largest data science community in Europe. We are more than 3,190 data scientists and data geeks in our community.
  • Machine Learning Mastery about helping professional programmers to confidently apply machine learning algorithms to address complex problems.
  • Daniel Forsyth - Personal Blog
  • Revolution Analytics - Data Science Blog
  • Spenczar a data scientist at Twitch. I handle the whole data pipeline, from tracking to model-building to reporting.
  • KD Nuggets Data Mining, Analytics, Big Data, Data, Science not a blog a portal
  • Meta Brown - Personal Blog
  • New Data Scientist How a Social Scientist Jumps into the World of Big Data
  • Harvard Data Science - Thoughts on Statistical Computing and Visualization
  • Kaggle Past Solutions
  • NYC Taxi Visualization Blog
  • Learning Lover
  • Dataists
  • Data-Mania
  • Data-Magnum
  • P-value - Musings on data science, machine learning and stats.
  • Digital transformation
  • Data Mania Blog - The File Drawer - Chris Said's science blog
  • Emilio Ferrara's web page
  • DataNews
  • Data Stories
  • Meaning of
  • Adventures in Data Land
  • DATA MINERS BLOG
  • FlowingData - Visualization and Statistics
  • Calculated Risk
  • i am trask - A Machine Learning Craftsmanship Blog
  • Dataconomy - A blog on the new emerging data economy
  • Data School - Data science tutorials for beginners!
  • Colah's Blog - Blog for understanding Neural Networks!
  • Distill - Dedicated to clear explanations of machine learning!
  • Books

  • Essential Natural Language Processing - Early access
  • Data Science at Scale with Python and Dask
  • The Data Science Handbook: Advice and Insights from 25 Amazing Data Scientists
  • Everyday Data Science & (cheaper PDF version)
  • Mining Massive Datasets - free e-book comprehended by an online course
  • Global Optimization Algorithms: Theory and Application - Free Download
  • Genetic Algorithms and Evolutionary Computation - Free Download
  • Neural Networks and Deep Learning
  • Artificial Intelligence: Foundations of Computational Agents, 2nd Edition - Free HTML version
  • The Quest for Artificial Intelligence: A History of Ideas and Achievements - Free Download
  • Podcasts

  • Data Science Mixer
  • Data Crunch
  • Adversarial Learning
  • Data Stories
  • Learning Machines 101
  • Linear Digressions
  • Partially Derivative
  • Journals, Publications and Magazines

  • Medium Data Science Topic - Data Science related publications on medium
  • Journal of Data Science - an international journal devoted to applications of statistical methods at large
  • ICML - International Conference on Machine Learning
  • epjdatascience
  • Journal of Big Data
  • Big Data & Society
  • datatau.com/news - Like Hacker News, but for data
  • Visualization Tools - Environments

  • NetworkX
  • Glue
  • addepar
  • anychart
  • cartodb
  • Cube
  • d3plus
  • dygraphs
  • ECharts
  • exhibit
  • highcarts
  • jqplot
  • Matplotlib
  • nvd3
  • Openrefine
  • raw
  • techanjs
  • Timeline
  • variancecharts
  • r2d3
  • Deep Learning

    tensorflow

  • TensorForcestars3.2k
  • Ludwigstars8.5k
  • tensorpackstars6.2k
  • TensorLayerstars7.1k
  • Deep Learning

    pytorch

  • pyrostars7.6k
  • pytorch_geometricstars15.7k
  • skorchstars4.7k
  • PyTounestars531
  • Machine Learning in General Purpose

  • modALstars1.8k
  • sigopt_sklearnstars72
  • scikit-learn
  • Shogun
  • Tutorials

  • Over 1000 Data Science Online Courses at Classpert Online Search Engine
  • MOOC's

  • Coursera Tensorflow in practice
  • Udacity - Deep Learning
  • Oxford Machine Learning
  • Data Mining - 5 Steps Courses, A Specialization on Coursera
  • CS 109 Data Science
  • CS 171 Visualization
  • Oxford Deep Learning
  • UBC Machine Learning - video
  • CS 231 - Convolutional Neural Networks for Visual Recognition
  • Intensive Programs

  • S2DS
  • COLLEGES

  • Master of Data Science @ Illinois Institute of Technology
  • Master of Data Science @ Melbourne University
  • M.S. Management & Data Science @ Leuphana
  • MS in Applied Data Science @ Syracuse
  • Data Science Degree @ UVA
  • Data Science Degree @ Berkeley
  • Data Science Degree @ Wisconsin
  • MS in Business Analytics @ ASU Online
  • Presentations

  • How to Become a Data Scientist
  • Introduction to Data Science
  • Intro to Data Science for Enterprise Big Data
  • How to Interview a Data Scientist
  • The Science of a Great Career in Data Science
  • What Does a Data Scientist Do?
  • Building Data Start-Ups: Fast, Big, and Focused
  • How to win data science competitions with Deep Learning
  • Competitions

  • Analytics Vidhya
  • Data Sets

  • Academic Torrents
  • hadoopilluminated.com
  • United States Census Bureau
  • usgovxml.com
  • enigma.com - Navigate the world of public data - Quickly search and analyze billions of public records published by governments, companies and organizations.
  • Public Big Data Sets
  • A Deep Catalog of Human Genetic Variation
  • Google Public Data
  • World Bank Data
  • NYC Taxi data
  • UC Irvine Machine Learning Repository - contains data sets good for machine learning
  • research-quality data sets by Hilary Mason
  • ClimateData.us (related: U.S. Climate Resilience Toolkit)
  • GHDx - Institute for Health Metrics and Evaluation - a catalog of health and demographic datasets from around the world and including IHME results
  • undata
  • NASA SocioEconomic Data and Applications Center - SEDAC
  • Sweden, Statistics
  • StackExchange Data Explorer - an open source tool for running arbitrary queries against public data from the Stack Exchange network.
  • Open data Index
  • GHTorrent
  • Feb 3rd

    Journals, Publications and Magazines

  • all AI news - The AI/ML/Big Data news aggregator platform
  • Jan 9th

    Machine Learning in General Purpose

  • Deepchecksstars2.1k
  • Jan 1st

    Visualization Tools - Environments

  • Netronstars20.1k
  • Dec 12th, 2021

    Nov 23rd, 2021

    Books

  • Graph Algorithms for Data Science - Early access
  • Oct 29th, 2021

    Oct 28th, 2021

    Oct 24th, 2021

    Bloggers

  • Maria Khalusova - Data science blog
  • Data Sets

  • MapLight - provides a variety of data free of charge for uses that are freely available to the general public. Click on a data set below to learn more
  • Podcasts

  • O'Reilly Data Show Podcast
  • Oct 23rd, 2021

    Youtube Videos & Channels

  • Neural networks from scratch by Sentdex
  • Oct 21st, 2021

    Newsletters

  • The Analytics Engineering Roundup. A newsletter about data science. Archive.
  • Oct 19th, 2021

    Visualization Tools - Environments

  • vizzustars1.6k
  • Oct 18th, 2021

    Learn Data Science

    Unsupervised Learning

  • Hidden Markov Models (HMM)
  • Books

  • Introduction to Machine Learning with Python
  • Oct 17th, 2021

    Oct 11th, 2021

    Oct 9th, 2021

    Learn Data Science

    Deep Learning architectures

  • Self-Organized Maps
  • Oct 1st, 2021

    Books

  • Streaming Systems: The What, Where, When, and How of Large-Scale Data Processing
  • Learn Data Science

    Unsupervised Learning

  • Dimension Reduction
    • Principal Component Analysis (PCA)
    • t-SNE
  • Learn Data Science

    Semi-Supervised Learning

  • S3VM
  • Generative models
  • Low-density separation
  • Laplacian regularization
  • Heuristic approaches
  • Learn Data Science

    Deep Learning architectures

  • Multilayer Perceptron
  • Convolutional Neural Network (CNN)
  • Recurrent Neural Network (RNN)
  • Boltzmann Machines
  • Autoencoder
  • Generative Adversarial Network (GAN)
  • Aug 14th, 2021

    Aug 3rd, 2021

    Bloggers

  • Loic Tetrel - Data science blog
  • Jun 16th, 2021

    Visualization Tools - Environments

  • altair
  • May 17th, 2021

    Apr 29th, 2021

    Mar 25th, 2021

    Tutorials

  • Realtime deployment Tutorial on Python time-series model deployment.
  • Competitions

  • Microprediction
  • Mar 20th, 2021

    Deep Learning

    pytorch

  • pytorch_tabularstars736
  • Bloggers

  • Deep and Shallow - All things Deep and Shallow in Data Science
  • Feb 24th, 2021

    Bloggers

  • nbshare - Data Science notebooks
  • Feb 8th, 2021

    Feb 3rd, 2021

    Dec 23rd, 2020

    Presentations

  • Full-Stack Data Scientist
  • Books

  • R for Data Science
  • Build a Career in Data Science
  • Machine Learning Bookcamp - Early access
  • Newsletters

  • AI Digest. A weekly newsletter to keep up to date with AI, machine learning, and data science. Archive.
  • DataTalks.Club. A weekly newsletter about data-related things. Archive.
  • Youtube Videos & Channels

  • DataTalks.Club
  • Slack Communities

  • DataTalks.Club
  • Data Sets

  • 5000 Images of Clothesstars63
  • Other Lists

  • Data Science Interviews Questionsstars6.6k
  • Nov 30th, 2020

    Hobby

  • Awesome Music Productionstars531
  • Tutorials

  • Tutorials to get started on signal processings for machine learningstars17
  • Bloggers

  • Jingles - Review and extract key concepts from academic papers
  • Nov 14th, 2020

    Tutorials

  • #tidytuesdaystars5k A weekly data project aimed at the R ecosystem.
  • Nov 11th, 2020

    Oct 31st, 2020

    Visualization Tools - Environments

  • ggplot2
  • C3
  • TensorWatchstars3.2k
  • bokeh
  • MOOC's

  • Linear Algebra - Linear Algebra course by Gilbert Strang
  • A 2020 Vision of Linear Algebra (G. Strang)
  • Tutorials

    Free Courses

  • AI Expert Roadmapstars21.5k - Roadmap to becoming an Artificial Intelligence Expert
  • Convex Optimization - Convex Optimization (basics of convex analysis; least-squares, linear and quadratic programs, semidefinite programming, minimax, extremal volume, and other problems; optimality conditions, duality theory...)
  • Books

  • Convex Optimization - Convex Optimization book by Stephen Boyd - Free Download
  • Oct 29th, 2020

    Oct 25th, 2020

    Learn Data Science

  • Algorithms
  • Colleges
  • MOOC's
  • Podcasts
  • Books
  • YouTube Videos & Channels
  • Toolboxes - Environment
  • Journals, Publications and Magazines
  • Presentations
  • Tutorials
  • Learn Data Science

    Supervised Learning

  • Linear Regression
  • Ordinary Least Squares
  • Logistic Regression
  • Stepwise Regression
  • Multivariate Adaptive Regression Splines
  • Locally Estimated Scatterplot Smoothing
  • Boosting
  • Bagging
  • Random Forest
  • AdaBoost
  • Learn Data Science

    Unsupervised Learning

  • Self-organizing map
  • Adaptive resonance theory
  • Learn Data Science

    Data Mining Algorithms

  • C4.5
  • k-Means
  • SVM
  • Apriori
  • EM
  • PageRank
  • AdaBoost
  • kNN
  • Naive Bayes
  • CART
  • COLLEGES

  • A list of colleges and universities offering degrees in data science.stars149
  • Msc in Data Science @ The University of Edinburgh
  • Master of Management Analytics @ Queen's University
  • MOOC's

  • Coursera Introduction to Data Science
  • Data Science - 9 Steps Courses, A Specialization on Coursera
  • Machine Learning – 5 Steps Courses, A Specialization on Coursera
  • OpenIntro
  • Process Mining: Data science in Action
  • Oxford Deep Learning - video
  • Data Science Specializationstars3.9k
  • Coursera Big Data Specialization
  • Statistical Thinking for Data Science and Analytics by Edx
  • Cognitive Class AI by IBM
  • Keras in Motion
  • Microsoft Professional Program for Data Science
  • COMP3222/COMP6246 - Machine Learning Technologies
  • Coursera Deep Learning Specialization
  • 365 Data Science Course
  • Coursera Natural Language Processing Specialization
  • Coursera GAN Specialization
  • Codecademy's Data Science
  • Tutorials

  • Data science your waystars562
  • PySpark Cheatsheetstars209
  • Machine Learning, Data Science and Deep Learning with Python
  • How To Label Data
  • Your Guide to Latent Dirichlet Allocation
  • Tutorials of source code from the book Genetic Algorithms with Python by Clinton Sheppardstars1k
  • Tutorials

    Free Courses

  • Data Scientist with R
  • Data Scientist with Python
  • Genetic Algorithms OCW Course
  • Visualization Tools - Environments

  • amcharts
  • slemma
  • Data-Driven Documents(D3js)
  • gephi
  • Google Chart Gallery
  • import.io
  • plot.ly
  • Seaborn
  • vida
  • Wrangler
  • Redash
  • Journals, Publications and Magazines

  • GECCO - The Genetic and Evolutionary Computation Conference (GECCO)
  • Big Data Research
  • Data Science Journal
  • Data Science Trello Board
  • Towards Data Science Genetic Algorithm Topic -Genetic Algorithm related Publications onTowards Data Science
  • Presentations

  • How to Share Data with a Statisticianstars6.2k
  • Books

  • Classic Computer Science Problems in Python
  • Evolutionary Algorithms - Free Download
  • Advances in Genetic Programming, Vol. 3 - Free Download
  • Bloggers

  • Analytics Vidhya - A full-fledged website about data science and analytics study material.
  • Chris Albon's Website - Data Science and AI notes
  • Youtube Videos & Channels

  • What is machine learning?
  • Andrew Ng: Deep Learning, Self-Taught Learning and Unsupervised Feature Learning
  • Deep Learning: Intelligence from Big Data
  • Interview with Google's AI and Deep Learning 'Godfather' Geoffrey Hinton
  • Introduction to Deep Learning with Python
  • What is machine learning, and how does it work?
  • Data School - Data Science Education
  • Neural Nets for Newbies by Melanie Warrick (May 2015)
  • Neural Networks video series by Hugo Larochelle
  • Google DeepMind co-founder Shane Legg - Machine Super Intelligence
  • Data Science Primer
  • Data Science with Genetic Algorithms
  • Data Science for Beginners
  • Competitions

  • Kaggle
  • DrivenData
  • InnoCentive
  • Data Sets

  • data.gov - The home of the U.S. Government's open data
  • datahub.io
  • aws.amazon.com/datasets
  • figshare.com
  • Quora's Big Datasets Answer
  • Kaggle Datasets
  • A community-curated database of well-known people, places, and things
  • Open Data Philly Connecting people with data for Philadelphia
  • grouplens.org Sample movie (with ratings), book and wiki datasets
  • r/datasets
  • St. Louis Federal Reserve Economic Data - FRED
  • New Zealand Institute of Economic Research – Data1850
  • Open Data Sourcesstars460
  • UNICEF Data
  • Public Git Archive
  • Microsoft Research Open Data
  • Open Government Data Platform India
  • NAYN.CO Turkish News with categoriesstars3
  • Covid-19stars1.1k
  • Covid-19 Googlestars112
  • Enron Email Dataset
  • Other Lists

  • Other amazingly awesome lists can be found in the awesome-awesomenessstars29.4k
  • Awesome Machine Learningstars56.1k
  • listsstars8.4k
  • awesome-pythonstars143.2k
  • Data Science IPython Notebooks.stars23.8k
  • awesome-rstars5.2k
  • awesome-Machine Learning & Deep Learning Tutorials
  • Machine Learning for Software Engineersstars26.2k
  • Community Curated Data Science Resources
  • Awesome Machine Learning On Source Codestars5.6k
  • Awesome Community Detectionstars2k
  • Awesome Graph Classificationstars4.5k
  • Awesome Decision Tree Papersstars2k
  • Awesome Fraud Detection Papersstars1.2k
  • Awesome Gradient Boosting Papersstars810
  • Awesome Computer Vision Modelsstars403
  • Awesome Monte Carlo Tree Searchstars483
  • Glossary of common statistics and ML terms
  • 100 NLP Papersstars3.4k
  • Oct 17th, 2020

    Podcasts

  • Data Skeptic
  • Superdatascience
  • What's The Point
  • Books

  • Machine Learning from Scratch
  • Fighting Churn With Data
  • Python Data Science Handbook
  • Think Like a Data Scientist
  • Introducing Data Science
  • Practical Data Science with R
  • Exploring Data Science - free eBook sampler
  • Exploring the Data Jungle - free eBook sampler
  • Math for Programmers Early access
  • R in Action, Third Edition Early access
  • Data Science Bookcamp Early access
  • Data Science Thinking: The Next Scientific, Technological and Economic Revolution
  • Applied Data Science: Lessons Learned for the Data-Driven Business
  • The Data Science Handbook
  • Pandas in Action - Early access
  • Genetic Algorithms and Genetic Programming
  • Advances in Evolutionary Algorithms - Free Download
  • Genetic Programming: New Approaches and Successful Applications - Free Download
  • Learn Data Science

    Supervised Learning

  • Regression
  • Classification
    • k-nearest neighbor
    • Support Vector Machines
    • Decision Trees
    • ID3 algorithm
    • C4.5 algorithm
  • Ensemble Learning
  • Learn Data Science

    Unsupervised Learning

  • Clustering
    • Hierchical clustering
    • k-means
    • Fuzzy clustering
    • Mixture models
  • Neural Networks
  • Learn Data Science

    Reinforcement Learning

  • Q Learning
  • SARSA (State-Action-Reward-State-Action) algorithm
  • Temporal difference learning
  • Github Groups

  • Berkeley Institute for Data Science
  • Jul 19th, 2020

    Bloggers

  • floydhub - Blog for Evolutionary Algorithms
  • Dec 7th, 2019

    Bloggers

  • Andrew Carr - Data Science with Esoteric programming languages
  • Mar 10th, 2019

    Telegram Channels

  • Open Data Science – First Telegram Data Science channel. Covering all technical and popular staff about anything related to Data Science: AI, Big Data, Machine Learning, Statistics, general Math and the applications of former.
  • Loss function porn — Beautiful posts on DS/ML theme with video or graphic vizualization.
  • Machinelearning – Daily ML news.
  • Feb 11th, 2017

    Bloggers

  • Matthew Russell - Mining The Social Web.
  • Hakan Kardas - Personal Web Page
  • Hilary Mason - Personal Web Page
  • Vamshi Ambati - AllThings Data Sciene
  • Siah a PhD student at Berkeley
  • Data Science Weekly - Weekly News Blog
  • R Bloggers - R Bloggers
  • The Practical Quant Big data
  • WhatSTheBigData is some of, all of, or much more than the above and this blog explores its impact on information technology, the business world, government agencies, and our lives.
  • DataScientistJourney
  • datascopeanalytics
  • datascientistjourney
  • Reddit TextMining
  • Hilary Parker
  • Data Science Lab
  • Dataclysm
  • Dominodatalab
  • Vademecum of Practical Data Science - Handbook and recipes for data-driven solutions of real-world problems
  • Occam's Razor - Focused on Web Analytics.
  • Jun 7th, 2016

    Facebook Accounts

  • The Data Science Blog
  • Jan 19th, 2016

    Facebook Accounts

  • Veri Bilimi Istanbul
  • Nov 29th, 2015

    Last Checked At: 2022-09-30T06:12:36.346Z
    Previous
    prakhar1989/awesome-courses
    Next
    siboehm/awesome-learn-datascience