Top 50 Awesome List

ChristosChristofidis/awesome-deep-learning

Computer Science  2 months ago  18.9k
A curated list of awesome Deep Learning tutorials, projects and communities.
View byDAY/WEEK/README
View on Github

May 2nd - May 8th, 2022

Books

  • Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow by Aurélien Géron | Oct 15, 2019
  • Courses

  • Deep Learning A.I.Shelf
  • Papers

  • Batch Normalization: Accelerating Deep Network Training by Reducing Internal Covariate Shift
  • DeepFaceDrawing: Deep Generation of Face Images from Sketches
  • Researchers

    Datasets

  • ArtEmis - Contains 450K affective annotations of emotional responses and linguistic explanations for 80,000 artworks of WikiArt.
  • SQuAD - Stanford released ~100,000 English QA pairs and ~50,000 unanswerable questions
  • FQuAD - ~25,000 French QA pairs released by Illuin Technology
  • GermanQuAD and GermanDPR - deepset released ~14,000 German QA pairs
  • SberQuADstars1 - Sberbank released ~90,000 Russian QA pairs
  • SANAD: Single-Label Arabic News Articles Dataset for Automatic Text Categorization - SANAD Dataset is a large collection of Arabic news articles that can be used in different Arabic NLP tasks such as Text Classification and Word Embedding. The articles were collected using Python scripts written specifically for three popular news websites: AlKhaleej, AlArabiya and Akhbarona.
  • Referit3D - Two large-scale and complementary visio-linguistic datasets (aka Nr3D and Sr3D) for identifying fine-grained 3D objects in ScanNet scenes. Nr3D contains 41.5K natural, free-form utterances, and Sr3d contains 83.5K template-based utterances.
  • Researchers

    Frameworks

  • InsNet - A neural network library for building instance-dependent NLP models with padding-free dynamic batchingstars58
  • Mazestars209 - Application-oriented deep reinforcement learning framework addressing real-world decision problems.
  • haystack: an open-source neural search framework
  • Apr 25th - May 1st, 2022

    Researchers

    Tools

  • Nebullvmstars1.2k - Easy-to-use library to boost deep learning inference leveraging multiple deep learning compilers.
  • Netronstars19.1k - Visualizer for deep learning and machine learning models
  • Mar 7th - Mar 13th, 2022

    Books

  • Engineering Deep Learning Platforms - by Chi Wang and Donald Szeto
  • Jan 31st - Feb 6th, 2022

    Researchers

    Websites

  • all AI news
  • Nov 1st - Nov 7th, 2021

    Oct 25th - Oct 31st, 2021

    Books

  • Deep Learning Patterns and Practices - by Andrew Ferlitsch
  • Inside Deep Learning - by Edward Raff
  • Deep Learning with Python, Second Edition - by François Chollet
  • Sep 20th - Sep 26th, 2021

    Researchers

    Datasets

  • LLVIPstars261 - 15488 visible-infrared paired images (30976 images) for low-light vision research, Project_Page
  • MSDAstars175 - Over over 5 million images from 5 different domains for multi-source ocr/text recognition DA research, Project_Page
  • Sep 13th - Sep 19th, 2021

    Books

  • Deep Learning for Natural Language Processing - by Stephan Raaijmakers
  • Aug 30th - Sep 5th, 2021

    Researchers

    Tools

  • Neptune - Lightweight tool for experiment tracking and results visualization.
  • Aug 2nd - Aug 8th, 2021

    Researchers

    Tools

  • Visual Studio Tools for AI - Develop, debug and deploy deep learning and AI solutions
  • Mar 29th - Apr 4th, 2021

    Researchers

    Tools

  • hubstars4.6k - Fastest unstructured dataset management for TensorFlow/PyTorch by activeloop.ai. Stream & version-control data. Converts large data into single numpy-like array on the cloud, accessible on any machine.
  • Feb 1st - Feb 7th, 2021

    Nov 23rd - Nov 29th, 2020

    Courses

  • Intro to Deep Learning with PyTorch - A great introductory course on Deep Learning by Udacity and Facebook AI
  • Deep Learning by Kaggle - Kaggle's free course on Deep Learning
  • Researchers

    Tools

  • DAGsHub - Community platform for Open Source ML – Manage experiments, data & models and create collaborative ML projects easily.
  • Oct 26th - Nov 1st, 2020

    Researchers

    Miscellaneous

  • AI Expert Roadmapstars20.5k - Roadmap to becoming an Artificial Intelligence Expert
  • Oct 12th - Oct 18th, 2020

    Books

  • TensorFlow 2.0 in Action - by Thushan Ganegedara
  • Sep 28th - Oct 4th, 2020

    Courses

  • AWS Machine Learning Machine Learning and Deep Learning Courses from Amazon's Machine Learning unviersity
  • Researchers

  • Andrej Karpathy
  • Sep 21st - Sep 27th, 2020

    Researchers

    Tools

  • Determinedstars1.7k - Deep learning training platform with integrated support for distributed training, hyperparameter tuning, smart GPU scheduling, experiment tracking, and a model registry.
  • Researchers

    Websites

  • ahmedbesbes.com
  • CatalyzeX: Machine Learning Hub for Builders and Makers
  • Videos and Lectures

  • Machine Learning CS 229 : End part focuses on deep learning By Andrew Ng
  • Books

  • Math and Architectures of Deep Learning - by Krishnendu Chaudhury
  • Aug 3rd - Aug 9th, 2020

    Researchers

    Tools

  • CatalyzeX - Browser extension (Chrome and Firefox) that automatically finds and links to code implementations for ML papers anywhere online: Google, Twitter, Arxiv, Scholar, etc.
  • Jul 20th - Jul 26th, 2020

    Videos and Lectures

  • Medical Imaging with Deep Learning Tutorial: This tutorial is styled as a graduate lecture about medical imaging with deep learning. This will cover the background of popular medical image domains (chest X-ray and histology) as well as methods to tackle multi-modality/view, segmentation, and counting tasks.
  • Researchers

    Websites

  • AI Hub - supported by AAAI, NeurIPS
  • Jun 29th - Jul 5th, 2020

    Videos and Lectures

  • CMU 11-785 Intro to Deep learning Spring 2020 Course: 11-785, Intro to Deep Learning by Bhiksha Raj
  • Deepmind x UCL Deeplearning: 2020 version
  • Deepmind x UCL Reinforcement Learning: Deep Reinforcement Learning
  • May 25th - May 31st, 2020

    Researchers

    Datasets

  • MIT Vision Texture - Image archive (100+ images) (Formats: ppm)
  • Researchers

    Miscellaneous

  • CNN Explainer
  • Mar 16th - Mar 22nd, 2020

    Researchers

    Websites

  • AI Summer
  • Mar 2nd - Mar 8th, 2020

    Tutorials

  • The Illustrated Self-Supervised Learning
  • Visual Paper Summary: ALBERT (A Lite BERT)
  • Researchers

    Websites

  • amitness.com
  • Researchers

    Miscellaneous

  • toolbox: Curated list of ML libraries
  • Researchers

    Datasets

  • Biometric Systems Lab - University of Bologna
  • Feb 17th - Feb 23rd, 2020

    Jan 27th - Feb 2nd, 2020

    Tutorials

  • Hardware for AI: Understanding computer hardware & build your own computerstars2
  • Programming Community Curated Resources
  • Courses

  • Face Detection with Computer Vision and Deep Learning by Hakan Cebeci
  • Grokking Deep Learning in Motion by Beau Carnes (2018)
  • Deep Reinforcement Learning (nanodegree) - Udacity a 3-6 month Udacity nanodegree, spanning multiple courses (2018)
  • Deep Learning from the Foundations Jeremy Howard - Fast.ai
  • Machine Learning for Mere Mortals video course by Nick Chase
  • Machine Learning Crash Course with TensorFlow APIs -Google AI
  • Books

  • Dive into Deep Learning - numpy based interactive Deep Learning book
  • Videos and Lectures

  • Deep Learning with R in Motion: a live video course that teaches how to apply deep learning to text and images using the powerful Keras library and its R language interface.
  • Researchers

    Websites

  • A Beginner's Guide To Understanding Convolutional Neural Networks
  • Machine Learning Mastery blog
  • ML Compiled
  • Programming Community Curated Resources
  • Researchers

    Miscellaneous

  • Ladder Networkstars98 - Keras Implementation of Ladder Network for Semi-Supervised Learning
  • The Unreasonable Effectiveness of Recurrent Neural Networks - Andrej Karpathy blog post about using RNN for generating text.
  • Microsoft Recommendersstars13.4k contains examples, utilities and best practices for building recommendation systems. Implementations of several state-of-the-art algorithms are provided for self-study and customization in your own applications.
  • Researchers

    Frameworks

  • TensorForce - A TensorFlow library for applied reinforcement learningstars3.1k
  • Synapses - A lightweight library for neural networks that runs anywherestars61
  • Catalyst: High-level utils for PyTorch DL & RL research. It was developed with a focus on reproducibility, fast experimentation and code/ideas reusingstars2.9k
  • garage - A toolkit for reproducible reinforcement learning researchstars1.5k
  • Detecto - Train and run object detection models with 5-10 lines of codestars555
  • Karate Club - An unsupervised machine learning library for graph structured datastars1.6k
  • Researchers

    Tools

  • ML Workspacestars2.6k - All-in-one web-based IDE for machine learning and data science.
  • dowelstars25 - A little logger for machine learning research. Log any object to the console, CSVs, TensorBoard, text log files, and more with just one call to logger.log()
  • Jan 20th - Jan 26th, 2020

    Courses

  • Deep Learning - UC Berkeley | STAT-157 by Alex Smola and Mu Li (2019)
  • Dec 9th - Dec 15th, 2019

    Researchers

    Datasets

  • National Design Repository - Over 55,000 3D CAD and solid models of (mostly) mechanical/machined engineering designs. (Formats: gif,vrml,wrl,stp,sat)
  • Oct 28th - Nov 3rd, 2019

    Courses

  • Deep Learning Specialization - Coursera - Breaking into AI with the best course from Andrew NG.
  • Aug 26th - Sep 1st, 2019

    Jul 22nd - Jul 28th, 2019

    Researchers

    Miscellaneous

  • YOLO: Practical Implementation using Python
  • AlphaGo - A replication of DeepMind's 2016 Nature publication, "Mastering the game of Go with deep neural networks and tree search"
  • Machine Learning for Software Engineersstars26k
  • Machine Learning is Fun!
  • Siraj Raval's Deep Learning tutorials
  • Dockerfacestars179 - Easy to install and use deep learning Faster R-CNN face detection for images and video in a docker container.
  • Awesome Deep Learning Musicstars2.3k - Curated list of articles related to deep learning scientific research applied to music
  • Awesome Graph Embeddingstars4.4k - Curated list of articles related to deep learning scientific research on graph structured data at the graph level.
  • Awesome Network Embeddingstars2.4k - Curated list of articles related to deep learning scientific research on graph structured data at the node level.
  • Jul 1st - Jul 7th, 2019

    Courses

  • MIT Intro to Deep Learning 7 day bootcamp - A seven day bootcamp designed in MIT to introduce deep learning methods and applications (2019)
  • Deep Blueberry: Deep Learning - A free five-weekend plan to self-learners to learn the basics of deep-learning architectures like CNNs, LSTMs, RNNs, VAEs, GANs, DQN, A3C and more (2019)
  • Spinning Up in Deep Reinforcement Learning - A free deep reinforcement learning course by OpenAI (2019)
  • Jun 3rd - Jun 9th, 2019

    Researchers

    Tools

  • TensorWatchstars3.2k - Debugging and visualization for deep learning
  • Mar 4th - Mar 10th, 2019

    Courses

  • AI for Everyone by Andrew Ng (2019)
  • Feb 25th - Mar 3rd, 2019

    Feb 18th - Feb 24th, 2019

    Nov 26th - Dec 2nd, 2018

    Videos and Lectures

  • Deep Learning Crash Course By Oliver Zeigermann
  • Sep 24th - Sep 30th, 2018

    Researchers

    Tools

  • Jupyter Notebook - Web-based notebook environment for interactive computing
  • TensorBoardstars5.9k - TensorFlow's Visualization Toolkit
  • Researchers

    Frameworks

  • albumentations - A fast and framework agnostic image augmentation librarystars10.4k
  • Researchers

    Miscellaneous

  • Torch7 Cheat sheet
  • Mar 19th - Mar 25th, 2018

    Mar 5th - Mar 11th, 2018

    Books

  • Deep Learning by Microsoft Research (2013)
  • Videos and Lectures

  • Making Sense of the World with Deep Learning By Adam Coates
  • Demystifying Unsupervised Feature Learning By Adam Coates
  • Deep Learning: Intelligence from Big Data by Steve Jurvetson (and panel) at VLAB in Stanford.
  • Papers

  • Image-to-Image Translation with Conditional Adversarial Networks
  • Berkeley AI Research (BAIR) Laboratory
  • Researchers

    Datasets

  • Content-based image retrieval database - 11 sets of color images for testing algorithms for content-based retrieval. Most sets have a description file with names of objects in each image. (Formats: jpg)
  • INRIA's Syntim stereo databases - 34 calibrated color stereo pairs (Formats: gif)
  • Image Analysis Laboratory
  • JAFFE Facial Expression Image Database - The JAFFE database consists of 213 images of Japanese female subjects posing 6 basic facial expressions as well as a neutral pose. Ratings on emotion adjectives are also available, free of charge, for research purposes. (Formats: TIFF Grayscale images.)
  • JISCT Stereo Evaluation - 44 image pairs. These data have been used in an evaluation of stereo analysis, as described in the April 1993 ARPA Image Understanding Workshop paper ``The JISCT Stereo Evaluation'' by R.C.Bolles, H.H.Baker, and M.J.Hannah, 263--274 (Formats: SSI)
  • MIT face images and more - hundreds of images (Formats: homebrew)
  • NIST Fingerprint data - compressed multipart uuencoded tar file
  • Geometric & Intelligent Computing Laboratory
  • OSU/SAMPL Database: Range Images, 3D Models, Stills, Motion Sequences - Over 1000 range images, 3D object models, still images and motion sequences (Formats: gif, ppm, vrml, homebrew)
  • Vision Research Group
  • ftp://ftp.limsi.fr/pub/quenot/opflow/testdata/piv/ - Real and synthetic image sequences used for testing a Particle Image Velocimetry application. These images may be used for the test of optical flow and image matching algorithms. (Formats: pgm (raw))
  • LIMSI-CNRS/CHM/IMM/vision
  • Photometric 3D Surface Texture Database - This is the first 3D texture database which provides both full real surface rotations and registered photometric stereo data (30 textures, 1680 images). (Formats: TIFF)
  • Department Image Understanding
  • Centre for Vision, Speech and Signal Processing
  • IAKS/KOGS
  • U Oulu wood and knots database - Includes classifications - 1000+ color images (Formats: ppm)
  • UCID - an Uncompressed Colour Image Database - a benchmark database for image retrieval with predefined ground truth. (Formats: tiff)
  • Yale Face Database B - 5760 single light source images of 10 subjects each seen under 576 viewing conditions (9 poses x 64 illumination conditions). (Formats: PGM)
  • Fashion-MNISTstars10.1k - MNIST like fashion product dataset consisting of a training set of 60,000 examples and a test set of 10,000 examples. Each example is a 28x28 grayscale image, associated with a label from 10 classes.
  • Feb 12th - Feb 18th, 2018

    Researchers

    Datasets

  • FakeNewsCorpusstars327 - Contains about 10 million news articles classified using opensources.co types
  • Feb 5th - Feb 11th, 2018

    Jan 22nd - Jan 28th, 2018

    Videos and Lectures

  • Deep Learning Crash Course: a series of mini-lectures by Leo Isikdogan on YouTube (2018)
  • Oct 9th - Oct 15th, 2017

    Sep 4th - Sep 10th, 2017

    Researchers

    Datasets

  • Visual Object Classes Challenge 2012 (VOC2012) - VOC2012 dataset containing 12k images with 20 annotated classes for object detection and segmentation.
  • Large-scale Fashion (DeepFashion) Database - Contains over 800,000 diverse fashion images. Each image in this dataset is labeled with 50 categories, 1,000 descriptive attributes, bounding box and clothing landmarks
  • Aug 14th - Aug 20th, 2017

    Jun 19th - Jun 25th, 2017

    Apr 3rd - Apr 9th, 2017

    Researchers

    Datasets

  • Digital Embryos - Digital embryos are novel objects which may be used to develop and test object recognition systems. They have an organic appearance. (Formats: various formats are available on request)
  • Mar 6th - Mar 12th, 2017

    Videos and Lectures

  • NIPS 2016 lecture and workshop videos - NIPS 2016
  • Feb 20th - Feb 26th, 2017

    Researchers

    Datasets

  • The AR Face Database - Contains over 4,000 color images corresponding to 126 people's faces (70 men and 56 women). Frontal views with variations in facial expressions, illumination, and occlusions. (Formats: RAW (RGB 24-bit))
  • Dec 5th - Dec 11th, 2016

    Oct 24th - Oct 30th, 2016

    Researchers

    Datasets

  • Open Images datasetstars4k - Open Images is a dataset of ~9 million URLs to images that have been annotated with labels spanning over 6000 categories.
  • YouTube-8M Dataset - YouTube-8M is a large-scale labeled video dataset that consists of 8 million YouTube video IDs and associated labels from a diverse vocabulary of 4800 visual entities.
  • Aug 29th - Sep 4th, 2016

    Aug 22nd - Aug 28th, 2016

    Videos and Lectures

  • Introduction to Artificial Neural Networks and Deep Learning by Leo Isikdogan at Motorola Mobility HQ
  • Aug 15th - Aug 21st, 2016

    Courses

  • Deep Learning Course by Yann LeCun (2016)
  • Aug 8th - Aug 14th, 2016

    Researchers

    Frameworks

  • Knet.jlstars1.4k
  • Researchers

    Datasets

  • CIFAR-10 and CIFAR-100
  • Jul 4th - Jul 10th, 2016

    Researchers

    Datasets

  • UMass Vision Image Archive - Large image database with aerial, space, stereo, medical images and more. (Formats: homebrew)
  • Jun 20th - Jun 26th, 2016

    May 23rd - May 29th, 2016

    May 16th - May 22nd, 2016

    Researchers

    Datasets

  • DeepMind QA Corpusstars1.3k - Textual QA corpus from CNN and DailyMail. More than 300K documents in total. Paper for reference.
  • Researchers

  • Ludovic Arnold
  • Patrick Nguyen
  • Rob Fergus
  • Tomáš Mikolov
  • Yotaro Kubo
  • Adam Coates
  • David Reichert
  • Justin A. Blanco
  • May 9th - May 15th, 2016

    Courses

  • Deep Learning - UWaterloo by Prof. Ali Ghodsi at University of Waterloo (2015)
  • Deep Learning - Udacity/Google by Vincent Vanhoucke and Arpan Chakraborty (2016)
  • Researchers

    Frameworks

  • SyntaxNet - Google's syntactic parser - A TensorFlow dependency library
  • DSSTNE - Amazon's library for building Deep Learning modelsstars4.4k
  • Apr 25th - May 1st, 2016

    Jan 11th - Jan 17th, 2016

    Oct 26th - Nov 1st, 2015

    Oct 12th - Oct 18th, 2015

    Sep 28th - Oct 4th, 2015

    Aug 3rd - Aug 9th, 2015

    Researchers

    Miscellaneous

  • A recurrent neural network designed to generate classical music.stars1.9k
  • Researchers

    Datasets

  • ICG Testhouse sequence - 2 turntable sequences from ifferent viewing heights, 36 images each, resolution 1000x750, color (Formats: PPM)
  • Amsterdam Library of Object Images - ALOI is a color image collection of one-thousand small objects, recorded for scientific purposes. In order to capture the sensory variation in object recordings, we systematically varied viewing angle, illumination angle, and illumination color for each object, and additionally captured wide-baseline stereo images. We recorded over a hundred images of each object, yielding a total of 110,250 images for the collection. (Formats: png)
  • AT&T Laboratories Cambridge face database
  • AVHRR Pathfinder
  • Air Freight - The Air Freight data set is a ray-traced image sequence along with ground truth segmentation based on textural characteristics. (455 images + GT, each 160x120 pixels). (Formats: PNG)
  • Annotated face, hand, cardiac & meat images - Most images & annotations are supplemented by various ASM/AAM analyses using the AAM-API. (Formats: bmp,asf)
  • Image Analysis and Computer Graphics
  • Brown University Stimuli - A variety of datasets including geons, objects, and "greebles". Good for testing recognition algorithms. (Formats: pict)
  • CAVIAR video sequences of mall and public space behavior - 90K video frames in 90 sequences of various human activities, with XML ground truth of detection and behavior classification (Formats: MPEG2 & JPEG)
  • Machine Vision Unit
  • CCITT Fax standard images - 8 images (Formats: gif)
  • CMU CIL's Stereo Data with Ground Truth - 3 sets of 11 images, including color tiff images with spectroradiometry (Formats: gif, tiff)
  • CMU PIE Database - A database of 41,368 face images of 68 people captured under 13 poses, 43 illuminations conditions, and with 4 different expressions.
  • CMU VASC Image Database - Images, sequences, stereo pairs (thousands of images) (Formats: Sun Rasterimage)
  • Caltech Image Database - about 20 images - mostly top-down views of small objects and toys. (Formats: GIF)
  • Columbia-Utrecht Reflectance and Texture Database - Texture and reflectance measurements for over 60 samples of 3D texture, observed with over 200 different combinations of viewing and illumination directions. (Formats: bmp)
  • Computational Colour Constancy Data - A dataset oriented towards computational color constancy, but useful for computer vision in general. It includes synthetic data, camera sensor data, and over 700 images. (Formats: tiff)
  • Computational Vision Lab
  • Efficient Content-based Retrieval Group
  • Densely Sampled View Spheres - Densely sampled view spheres - upper half of the view sphere of two toy objects with 2500 images each. (Formats: tiff)
  • Computer Science VII (Graphical Systems)
  • El Salvador Atlas of Gastrointestinal VideoEndoscopy - Images and Videos of his-res of studies taken from Gastrointestinal Video endoscopy. (Formats: jpg, mpg, gif)
  • FG-NET Facial Aging Database - Database contains 1002 face images showing subjects at different ages. (Formats: jpg)
  • FVC2000 Fingerprint Databases - FVC2000 is the First International Competition for Fingerprint Verification Algorithms. Four fingerprint databases constitute the FVC2000 benchmark (3520 fingerprints in all).
  • Face and Gesture images and image sequences - Several image datasets of faces and gestures that are ground truth annotated for benchmarking
  • German Fingerspelling Database - The database contains 35 gestures and consists of 1400 image sequences that contain gestures of 20 different persons recorded under non-uniform daylight lighting conditions. (Formats: mpg,jpg)
  • Language Processing and Pattern Recognition
  • Groningen Natural Image Database - 4000+ 1536x1024 (16 bit) calibrated outdoor images (Formats: homebrew)
  • Institute of Computer Graphics and Vision
  • IEN Image Library - 1000+ images, mostly outdoor sequences (Formats: raw, ppm)
  • INRIA's Syntim images database - 15 color image of simple objects (Formats: gif)
  • INRIA
  • Image Analysis Laboratory - Images obtained from a variety of imaging modalities -- raw CFA images, range images and a host of "medical images". (Formats: homebrew)
  • Image Database - An image database including some textures
  • ATR Research, Kyoto, Japan
  • Machine Vision - Images from the textbook by Jain, Kasturi, Schunck (20+ images) (Formats: GIF TIFF)
  • Mammography Image Databases - 100 or more images of mammograms with ground truth. Additional images available by request, and links to several other mammography databases are provided. (Formats: homebrew)
  • ftp://ftp.cps.msu.edu/pub/prip - many images (Formats: unknown)
  • Middlebury Stereo Data Sets with Ground Truth - Six multi-frame stereo data sets of scenes containing planar regions. Each data set contains 9 color images and subpixel-accuracy ground-truth data. (Formats: ppm)
  • Middlebury Stereo Vision Research Page - Middlebury College
  • Modis Airborne simulator, Gallery and data set - High Altitude Imagery from around the world for environmental modeling in support of NASA EOS program (Formats: JPG and HDF)
  • NIST Fingerprint and handwriting - datasets - thousands of images (Formats: unknown)
  • NLM HyperDoc Visible Human Project - Color, CAT and MRI image samples - over 30 images (Formats: jpeg)
  • OSU (MSU) 3D Object Model Database - several sets of 3D object models collected over several years to use in object recognition research (Formats: homebrew, vrml)
  • OSU (MSU/WSU) Range Image Database - Hundreds of real and synthetic images (Formats: gif, homebrew)
  • Signal Analysis and Machine Perception Laboratory
  • Otago Optical Flow Evaluation Sequences - Synthetic and real sequences with machine-readable ground truth optical flow fields, plus tools to generate ground truth for new sequences. (Formats: ppm,tif,homebrew)
  • LIMSI-CNRS
  • SEQUENCES FOR OPTICAL FLOW ANALYSIS (SOFA) - 9 synthetic sequences designed for testing motion analysis applications, including full ground truth of motion and camera parameters. (Formats: gif)
  • Computer Vision Group
  • Sequences for Flow Based Reconstruction - synthetic sequence for testing structure from motion algorithms (Formats: pgm)
  • Stereo Images with Ground Truth Disparity and Occlusion - a small set of synthetic images of a hallway with varying amounts of noise added. Use these images to benchmark your stereo algorithm. (Formats: raw, viff (khoros), or tiff)
  • Stuttgart Range Image Database - A collection of synthetic range images taken from high-resolution polygonal models available on the web (Formats: homebrew)
  • Purdue Robot Vision Lab
  • The MIT-CSAIL Database of Objects and Scenes - Database for testing multiclass object detection and scene recognition algorithms. Over 72,000 images with 2873 annotated frames. More than 50 annotated object classes. (Formats: jpg)
  • The RVL SPEC-DB (SPECularity DataBase) - A collection of over 300 real images of 100 objects taken under three different illuminaiton conditions (Diffuse/Ambient/Directed). -- Use these images to test algorithms for detecting and compensating specular highlights in color images. (Formats: TIFF )
  • Robot Vision Laboratory
  • The Xm2vts database - The XM2VTSDB contains four digital recordings of 295 people taken over a period of four months. This database contains both image and video data of faces.
  • Traffic Image Sequences and 'Marbled Block' Sequence - thousands of frames of digitized traffic image sequences as well as the 'Marbled Block' sequence (grayscale images) (Formats: GIF)
  • U Bern Face images - hundreds of images (Formats: Sun rasterfile)
  • U Michigan textures (Formats: compressed raw)
  • UNC's 3D image database - many images (Formats: GIF)
  • USF Range Image Data with Segmentation Ground Truth - 80 image sets (Formats: Sun rasterimage)
  • University of Oulu Physics-based Face Database - contains color images of faces under different illuminants and camera calibration conditions as well as skin spectral reflectance measurements of each person.
  • Machine Vision and Media Processing Unit
  • University of Oulu Texture Database - Database of 320 surface textures, each captured under three illuminants, six spatial resolutions and nine rotation angles. A set of test suites is also provided so that texture segmentation, classification, and retrieval algorithms can be tested in a standard manner. (Formats: bmp, ras, xv)
  • Machine Vision Group
  • Usenix face database - Thousands of face images from many different sites (circa 994)
  • View Sphere Database - Images of 8 objects seen from many different view points. The view sphere is sampled using a geodesic with 172 images/sphere. Two sets for training and testing are available. (Formats: ppm)
  • PRIMA, GRAVIR
  • Vision-list Imagery Archive - Many images, many formats
  • Wiry Object Recognition Database - Thousands of images of a cart, ladder, stool, bicycle, chairs, and cluttered scenes with ground truth labelings of edges and regions. (Formats: jpg)
  • 3D Vision Group
  • Yale Face Database - 165 images (15 individuals) with different lighting, expression, and occlusion configurations.
  • Center for Computational Vision and Control
  • Videos and Lectures

  • Natural Language Processing By Chris Manning in Stanford
  • Books

  • Deep Learning Tutorial by LISA lab, University of Montreal (Jan 6 2015)
  • neuraltalkstars5.3k by Andrej Karpathy : numpy-based RNN/LSTM implementation
  • Artificial Intelligence: A Modern Approach
  • Deep Learning in Neural Networks: An Overview
  • Researchers

    Frameworks

  • MatConvNet: CNNs for MATLABstars1.3k
  • Tutorials

  • VGG Convolutional Neural Networks Practical
  • Papers

  • Ask Me Anything: Dynamic Memory Networks for Natural Language Processing
  • Jul 27th - Aug 2nd, 2015

    Researchers

    Websites

  • Deep Learning News
  • Researchers

    Frameworks

  • char-rnnstars10.9k