ABCDEFGHIJKLMNOPQRSTUVWX
1
Neuroscience-Inspired Artificial Intelligence (Hassabis overview article and outlook onto Deep Learning)
2
SIPB Deep Learning Group - code implementations
3
4
yellow cell = assigned paper
5
green cell = free to select paper (recommended)
6
white cell = free to select paper
7
8
ROBOTS BOOTSTRAPPED THROUGH LEARNING FROM EXPERIENCE (>200 papers)
9
SFM-Visual-SLAM (a number of different implementations)
10
11
392247 ISY Project: Deep Learning Architectures for AI (Pj) (SoSe 2019)
12
Logic-Geometric Programming: An Optimization-Based Approach to Combined Task and Motion Planninghttps://www.youtube.com/watch?v=B2s85xfo2uE
13
Deep, Probabilistic and Semantic 3D Reconstructionhttps://github.com/paschalidoud/raynet
14
Superquadrics Revisited: Learning 3D Shape Parsing beyond Cuboids
15
MOTS: Multi-Object Tracking and Segmentationhttps://github.com/VisualComputingInstitute/mots_tools
16
Deep Marching Cubes: Learning Explicit Surface Representationshttps://www.youtube.com/watch?v=vhrvl9qOSKM
17
UnDeepVO: Monocular Visual Odometry through Unsupervised Deep Learninghttps://github.com/drmaj/UnDeepVO
18
SfM-Net: Learning of Structure and Motion from Videohttps://github.com/waxz/sfm_net
19
DeMoN: Depth and Motion Network for Learning Monocular Stereohttps://github.com/lmb-freiburg/demon
20
Real-time trajectory replanning for MAVs using uniform B-splines and a 3D circular bufferhttps://vision.in.tum.de/research/robotvision/replanninghttps://github.com/VladyslavUsenko/ewok
21
OctoMap: an efficient probabilistic 3D mapping framework based on octreeshttps://octomap.github.io
22
Learning monocular visual odometry with dense 3D mapping from dense 3D flowhttps://youtu.be/Ccj1O7yndIk?t=80
23
Deep Auxiliary Learning for Visual Localization and Odometry
24
FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networkshttps://github.com/search?q=FlowNet+2.0https://youtu.be/JSzUdVBmQP4
25
Goal directed dynamics
26
Exploration by Random Network Distillationhttps://github.com/openai/random-network-distillation
https://openai.com/blog/reinforcement-learning-with-prediction-based-rewards/
27
Deep Q-learning from DemonstrationsRoman Heinrichhttps://ai.google/research/pubs/pub46980https://github.com/go2sea/DQfD
https://github.com/search?q=Deep+Q-Learning+from+demonstrations&type=Repositories
28
Deep Reinforcement Learning that Matters
29
Learning Latent Dynamics for Planning from Pixelshttps://www.shortscience.org/paper?bibtexKey=journals/corr/1811.04551#wassnamehttp://www.arxiv-sanity.com/1811.04551
30
Adam: A method for stochastic optimizationRoman Heinrich
31
The option-critic architecture
32
Interpretable Latent Spaces for Learning from Demonstration
33
Conditional Affordance Learning for Driving in Urban Environmentshttp://www.cvlibs.net/publications/Sauer2018CORL_supplementary.pdfhttp://www.youtube.com/watch?v=UtUbpigMgr0
34
35
36
TOPIC: NIPS 2018
37
Learning to Control Visual Abstractions for Structured Exploration in Deep Reinforcement Learning
38
Artificial Intelligence for Prosthetics — challenge solutions (NeurIPS 2018)https://www.crowdai.org/challenges/neurips-2018-ai-for-prosthetics-challenge
39
Recurrent World Models Facilitate Policy Evolution
40
41
42
TOPIC: DRL WS2018
43
Terrain-Adaptive Locomotion Skills Using Deep Reinforcement Learning
44
Towards Generalization and Simplicity in Continuous Control
45
PVEs: Position-Velocity Encoders for Unsupervised Learning of Structured State Representations
46
MuJoCo: A physics engine for model-based control
47
48
TOPIC 1: discrete action space - Atari games
49
Imagination-Augmented Agents for Deep Reinforcement LearningLennart Bramlagehttps://github.com/yilundu/imagination_augmented_agents
50
Reinforcement Learning with Unsupervised Auxiliary TasksHendric Vosshttps://youtu.be/-YiMVR3HEuYhttps://github.com/miyosuda/unreal
https://github.com/NoobFang/multi-process-UNREAL
51
TreeQN and ATreeC: Differentiable Tree-Structured Models for Deep Reinforcement LearningLukas Hindemith (AML)https://github.com/oxwhirl/treeqn/
52
Curiosity-driven Exploration by Self-supervised PredictionSunkara Bhargavhttps://pathak22.github.io/noreward-rl
https://youtu.be/_Z9ZP1eiKsI
53
Progressive Neural NetworksFederico Rossettohttps://github.com/synpon/prog_nnhttps://github.com/howland/DQN_PNN
54
Model-Free Episodic ControlAleksandrs Stierhttps://github.com/sudeepraja/Model-Free-Episodic-Control
https://github.com/ShibiHe/Model-Free-Episodic-Control
55
Dueling Network Architectures for Deep Reinforcement LearningAndrewhttps://youtu.be/Ma1b6EeHlV0https://github.com/gokhanettin/dddqn-tf
https://www.youtube.com/results?search_query=Dueling+Network+Architectures+for+Deep+Reinforcement+Learning&page=&utm_source=opensearch
56
Metacontrol for adaptive imagination-based optimizationhttps://github.com/deepmind/spaceship_dataset
57
Discovering objects and their relations from entangled scene representations
58
Action-Conditional Video Prediction Using Deep Networks in Atari GamesSimon Müller-Cleve
59
Attend, Adapt and Transfer: Attentive Deep Architecture for Adaptive Transfer from multiple sources in the same domain
60
Neural Episodic Control
61
Variational Intrinsic Control ICLR17W
62
Hierarchical deep reinforcement learning: Integrating temporal abstraction and intrinsic motivationFederico Rossetto 190117https://github.com/mrkulk/hierarchical-deep-RLhttp://mrkulk.github.io/notes/deephrl
https://www.reddit.com/r/MachineLearning/comments/4frm32/160406057_hierarchical_deep_reinforcement/
63
Deep Reinforcement Learning Discovers Internal Models
64
The Reactor: A sample-efficient actor-critic architecture
65
Deep successor reinforcement learninghttps://github.com/Ardavans/DSR
66
The predictron: End-to-end learning and planning
67
The Option-Critic Architecture
68
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
69
FeUdal Networks for Hierarchical Reinforcement LearningLennart Bramlage 190117https://github.com/dmakian/feudal_networks
70
Value Prediction NetworkNIPS 2017
71
Interaction Networks for Learning about Objects, Relations and Physicshttps://github.com/jaesik817/Interaction-networks_tensorflow
72
Strategic Attentive Writer for Learning Macro-Actions
73
Deep Exploration via Bootstrapped DQN
https://www.youtube.com/watch?v=6SAdmG3zAMg&t=0s&index=1&list=PLdy8eRAW78uLDPNo1jRv8jdTx7aup1ujM
74
Independently Controllable Features
75
Asynchronous Methods for Deep Reinforcement LearningTimo Weike
76
Behavior is Everything – Towards Representing Concepts with Sensorimotor Contingencieshttps://github.com/vicariousinc/pixelworld
77
Visual Analogies between Atari Games for Studying Transfer Learning in RLICLR18W - Rejected
78
Schema Networks: Zero-shot Transfer with a Generative Causal Model of Intuitive PhysicsNO CODE IMPLEMENTATIONhttps://github.com/vicariousinc/schema-games
79
80
81
TOPIC 2: high-dimensional continuous control problems - Locomotion
82
Proximal Policy Optimization AlgorithmsHendrik Lücking
83
High-Dimensional Continuous Control Using Generalized Advantage EstimationBalavivek Sivananthamhttps://github.com/Anjum48/rl-examples/tree/56aca982fcf4426c02aa7e5fb58a4f8affab8020
https://github.com/ray-project/ray/tree/master/python/ray/rllib
https://github.com/search?q=Generalized+Advantage+Estimation&ref=opensearch&type=Code
https://github.com/search?p=2&q=HIGH-DIMENSIONAL+CONTINUOUS+CONTROL+USING+GENERALIZED+ADVANTAGE+ESTIMATION&ref=opensearch&type=Code
https://www.youtube.com/results?search_query=Generalized+Advantage+Estimation&page=&utm_source=opensearch
https://www.youtube.com/watch?v=jymFj7bNsKg
84
Continuous deep q-learning with model-based accelerationSebastian Muellerhttps://github.com/carpedm20/NAF-tensorflow https://github.com/semueller/NAF-tensorflow
85
Learning Robust Rewards with Adversarial Inverse Reinforcement LearningMarkus Vieth
86
Meta learning shared hierarchiesJan Eberthttps://github.com/openai/mlsh
87
Continuous control with deep reinforcement learningAleksandrs Stier (robotic-arms)https://github.com/openai/baselines/tree/master/baselines/her
88
DARLA: Improving Zero-Shot Transfer in Reinforcement LearningLuca Lach
89
On the Continuity of Rotation Representations in Neural Networks
90
Divide-and-Conquer Reinforcement Learning
91
Emergence of Locomotion Behaviours in Rich Environmentshttps://youtu.be/hx_bgoTF7bs
92
Evolution Strategies as a Scalable Alternative to Reinforcement Learning
93
Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines
94
95
96
TOPIC 3: VAE
97
World ModelsSebastianhttps://github.com/AppliedDataSciencePartners/WorldModels
98
The Dreaming Variational Autoencoder for Reinforcement Learning EnvironmentsSebastian
99
Stable Reinforcement Learning with Autoencoders for Tactile and Visual DataSimon Müller-Cleve
100
Vote3Deep: Fast Object Detection in 3D Point Clouds Using Efficient Convolutional Neural Networks