Georgia Tech Computer Vision Reading Group
Spring 2021
Date & Time: Tuesdays 3-4pm
Location: BlueJeans
To subscribe to the mailing list for presentation announcements, join the Google groups. You can also add/import the Google calendar, which will be updated with the schedule.
Schedule
Date | Topic(s)/Paper(s) | Presenter(s) |
---|---|---|
Jan 26, 2021 | Self-Attention Based Context-Aware 3D Object Detection. Prarthana Bhattacharyya, Chengjie Huang, Kszysztof Czarnecki. arXiv preprint. [arXiv] | Ben |
Feb 2, 2021 | What is being transferred in transfer learning? Behnam Neyshabur*, Hanie Sedghi*, Chiyuan Zhang*. NeurIPS 2020. [arXiv] | Cusuh |
Feb 9, 2021 | Learning Transferable Visual Models from Natural Language Supervision. Alec Radford*, Jong Wook Kim*, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Cirish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, Gretchen Krueger, Ilya Sutskever. 2021. [CLIP blog, paper] | Daniel |
Feb 16, 2021 | Solving Rubik's Cube with a Robot Hand. Ilge Akkaya, Marcin Andrychowicz, Maciek Chociej, Mateusz Litwin, Bob McGrew, Arthur Petron, Alex Paino, Matthias Plappert, Glenn Powell, Raphael Ribas, Jonas Schneider, Nikolas Tezak, Jerry Tworek, Peter Welinder, Lilian Weng, Qiming Yuan, Wojciech Zaremba, Lei Zhang. arXiv preprint. [blog, paper] | Angel |
Feb 23, 2021 | No meeting | -- |
Mar 2, 2021 | ICCV deadline approaching | -- |
Mar 9, 2021 | ICCV deadline approaching | -- |
Mar 16, 2021 | ICCV deadline approaching | -- |
Mar 23, 2021 | No meeting | -- |
Mar 30, 2021 |
- SuperGlue: Learning Feature Matching with Graph Neural Networks. Paul-Edouard Sarlin, Daniel DeTone, Tomasz Maliesiewicz, Andrew Rabinovich. CVPR 2020. [arXiv] - Simple multi-dataset detection. Xingyi Zhou, Vladlen Koltun, Philipp Krähenbühl. arXiv preprint. [arXiv] |
John |
Apr 6, 2021 | Barlow Twins: Self-Supervised Learning via Redundancy Reduction. Jure Zbontar*, Li Jing*, Ishan Misra, Yann LeCun, Stéphane Deny. arXiv preprint. [arXiv] | Sean |
Apr 13, 2021 | Cancelled | James |
Apr 20, 2021 | SCAN: Learning to Classify Images without Labels. Wonter Van Gansbeke*, Simon Vandenhende*, Stamatios Georgoulis, Marc Proesmans, Luc Van Gool. ECCV 2020. [arXiv] | Jon W. |
Apr 27, 2021 | Patsorn |
Previous semesters
Fall 2020
Date | Topic(s)/Paper(s) | Presenter(s) |
---|---|---|
Aug 28, 2020 | Kick-off meeting | Cusuh |
Sep 4, 2020 | Highlights from CVPR 2020 & ECCV 2020 | Amit & John |
Sep 11, 2020 | 3D object detection | Ben |
Sep 18, 2020 |
Out-of-distribution detection
- Uncertainty Estimation Using a Single Deep Deterministic Neural Network. Joost van Amersfoort, Lewis Smith, Yee Whye Teh, Yarin Gal. ICML 2020. [arXiv] |
Cusuh |
Sep 25, 2020 |
Object-centric representations of scenes
- MONet: Unsupervised Scene Decomposition and Representation. Christopher P. Burgess, Loic Matthey, Nicholas Watters, Rishabh Kabra, Irina Higgins, Matt Botvinick, Alexander Lerchner. arXiv preprint. [arXiv] |
Niranjan |
Oct 2, 2020 |
- Language Models are Few-Shot Learners. Tom B. Brown*, Benjamin Mann*, Nick Ryder*, Melanie Subbiah*, Jared Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Hervert-Voss, Gretchen Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel M. Ziegler, Jeffrey Wu, Clemens Winter, Chistopher Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, Dario Amodei. arXiv prepint. [arXiv]
Text-based image retrieval
- Stacked Cross Attention for Image-Text Matching. Kuang-Huei Lee, Xi Chen, Gang Hua, Houdong Hu, Xiaodong He. ECCV 2018. [arXiv] |
Patsorn |
Oct 9, 2020 |
- Understanding vision models. David Bau. - Rewriting a Deep Generative Model. David Bau, Steven Liu, Tongzhou Wang, Jun-Yan Zhu, Antonio Torralba. ECCV 2020. [arXiv] |
Joel |
Oct 16, 2020 | When will you do what? - Anticipating Temporal Occurrences of Activities. Yazan Abu Farha, Alexander Richard, Juergen Gall. CVPR 2018. [paper] | Dan |
Oct 23, 2020 |
*** Time change to 12pm *** Adversarial Continual Learning. Sayna Ebrahimi, Franziska Meier, Roberto Calandra, Trevor Darrell, Marcus Rohrbach. ECCV 2020. [arXiv] |
James |
Oct 30, 2020 | Cancelled | Aastha |
Nov 6, 2020 | CVPR deadline approaching | -- |
Nov 13, 2020 | CVPR deadline approaching | -- |
Nov 20, 2020 | CVPR supplementary deadline approaching | -- |
Nov 27, 2020 | Thanksgiving break | -- |
Dec 4, 2020 | An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale. Alexey Dosovitskiy*, Lucas Beyer*, Alexander Kolesnikov*, Dirk Weissenborn*, Xiaohua Zhai*, Thomas Unterthiner, Mostafa Deghani, Matthias Minderer, Georg Heigold, Sylvain Gelly, Jakob Uszkoreit, Neil Houlsby*. arXiv preprint. [arXiv] | Sean |
Dec 11, 2020 |
Transformers |
Arjun |
Spring 2020
Date | Topic(s)/Paper(s) | Presenter(s) |
---|---|---|
Jan 13, 2020 | Kick-off meeting (Coda C1108 Brookhaven) | Cusuh |
Jan 22, 2020 | No meeting (previously cancelled for MLK Jr. Day) | -- |
Jan 29, 2020 | Cancelled | Amit |
Feb 5, 2020 | Uniform convergence may be unable to explain generalization in deep learning. Vaishnavh Nagarajan, J. Zico Kolter. NeurIPS 2019. [paper] | Nathan |
Feb 12, 2020 |
Explainability
- Explainable AI: Beware of Inmates Running the Asylum. Tim Miller, Piers Howe, Liz Sonenberg. IJCAI 2017 Workshop on Explainable AI. [arXiv] |
Jon |
Feb 19, 2020 | Continual Unsupervised Representation Learning. Dushyant Rao, Francesco Visin, Andrei A. Rusu, Yee Whye Teh, Razvan Pascanu, Raia Hadsell. NeurIPS 2019. [paper] | James |
Feb 26, 2020 | ICCV deadline approaching | -- |
Mar 4, 2020 | ICCV deadline approaching | -- |
Mar 11, 2020 |
Unsupervised/self-supervised learning of visual representations
- Momentum Contrast for Unsupervised Visual Representation Learning. Kaiming He, Haoqi Fan, Yuxin Wu, Saining Xie, Ross Girshick. arXiv preprint. [arXiv] |
Arjun M. |
Mar 18, 2020 | Spring break | -- |
Mar 25, 2020 |
*** Join via Hangouts***
Deep feature detectors
- Key.Net: Keypoint Detection by Handcrafted and Learned CNN Filters. Axel Barroso-Laguna, Edgar Riba, Daniel Ponsa, Krystian Mikolajczyk. ICCV 2019. [arXiv] Deep joint feature detector-descriptors
- SuperPoint: Self-Supervised Interest Point Detection and Description. Daniel DeTone, Tomasz Malisiewicz, Andrew Rabinovich. CVPR 2018. [arXiv] Deep feature descriptors - Universal Correspondence Network. Christopher B. Choy, JunYoung Gwak, Silvio Savarese, Manmohan Chandraker. NeurIPS 2016. [arXiv] Deep putative correspondence verification
- Learning to Find Good Correspondences. Kwang Moo Yi*, Eduard Trulls*, Yuki Ono, Vincent Lepetit, Mathieu Salzmann, Pascal Fua. CVPR 2018. [arXiv] |
John |
Apr 1, 2020 | Cancelled | Meera |
Apr 8, 2020 |
*** Join via BlueJeans***
Out-of-distribution detection
- Do Deep Generative Models Know What They Don't Know?. Eric Nalisnick, Akihiro Matsukawa, Yee Whye Teh, Dilan Gorur, Balaji Lakshminarayanan. ICLR 2019. [arXiv] |
Cusuh |
Apr 15, 2020 |
*** Join via BlueJeans*** Learning Compositional Representations for Few-Shot Recognition. Pavel Tokmakov, Yu-Xiong Wang, Martial Hebert. ICCV 2019. [arXiv] |
Sean |
Apr 22, 2020 | Reading period | -- |
Apr 29, 2020 |
*** Join via BlueJeans*** Defense practice talk |
Samarth |
Fall 2019
Date | Topic(s)/Paper(s) | Presenter(s) |
---|---|---|
Aug 28, 2019 | Kick-off meeting | Cusuh |
Sep 4, 2019 | CVPR recap:
Vision for self-driving
- Panoptic Segmentation. Alexander Kirillov, Kaiming He, Ross Girshick, Carsten Rother, Piotr Dollár. CVPR 2019. [paper] Incorporating 3D information in networks
- 4D Spatio-Temporal ConvNets: Minkowski Convolutional Neural Networks. Christopher Choy, JunYoung Gwak, Silvio Savarese. CVPR 2019. [paper] Generative models
- A Style-Based Generator Architecture for Generative Adversarial Networks. Tero Karras, Samuli Laine, Timo Aila. CVPR 2019. [arXiv] |
Amit & John |
Sep 11, 2019 |
Tracking
- Tracking without bells and whistles. Philipp Bergmann*, Tim Meinhardt*, Laura Leal-Taixe. arXiv pre-print 2019. [arXiv] |
Sean |
Sep 18, 2019 |
Uncertainty
- Modeling Uncertainty with Hedged Instance Embedding. Seong Joon Oh, Kevin Murphy, Jiyan Pan, Joseph Roth, Florian Schroff, Andrew Gallagher. ICLR 2019. [arXiv] |
Cusuh |
Sep 25, 2019 |
Object detection
- Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. Shaoqing Ren, Kaiming He, Ross Girshick, Jian Sun. NeurIPS 2015. [arXiv] Real-time object detection
- SSD: Single Shot MultiBox Detector. Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, Alexander C. Berg. ECCV 2016. [arXiv] Instance segmentation
- Fully Convolutional Instance-aware Semantic Segmentation. Yi Li*, Haozhi Qi*, Jifeng Dai, Xiangyang Ji, Yichen Wei. CVPR 2017. [arXiv] Real-time instance segmentation - YOLACT: Real-Time Instance Segmentation. Daniel Bolya, Chong Zhou, Fanyi Xiao, Yong Jae Lee. ICCV 2019. [arXiv] |
Daniel |
Oct 2, 2019 | PU-GAN: a Point Cloud Upsampling Adversarial Network. Ruihui Li, Xianzhi Li, Chi-Wing Fu, Daniel Cohen-Or, Pheng-Ann Heng. ICCV 2019. [arXiv] | Patsorn |
Oct 9, 2019 | Invariant Information Clustering for Unsupervised Image Classification and Segmentation. Xu Ji, João F. Henriques, Andrea Vedaldi. ICCV 2019. [arXiv] | James |
Oct 16, 2019 |
Learning textures for meshes - Texture Fields: Learning Texture Representations in Function Space. Michael Oechsle, Lars Mescheder, Michael Niemeyer, Thilo Strauss, Andreas Geiger. ICCV 2019. [paper] - PIFu: Pixel-Aligned Implicit Function for High-Resolution Clothes Human Digitization. Shunsuke Saito*, Zeng Huang*, Ryota Natsume*, Shigeo Morishima, Angjoo Kanazawa, Hao Li. ICCV 2019. [arXiv] Temporal deformation on meshes - Occupancy Flow: 4D Reconstruction by Learning Particle Dynamics. Michael Niemeyer, Lars Mescheder, Michael Oechsle, Andreas Geiger. ICCV 2019. [paper] Function-based 3D representations - Occupancy Networks: Learning 3D Reconstruction in Function Space. Lars Mescheder, Michael Oechsle, Michael Niemeyer, Sebastian Nowozin, Andreas Geiger. CVPR 2019. [arXiv] - DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation. Jeong Joon Park, Peter Florence, Julian Straub, Richard Newcombe, Steven Lovegrove. CVPR 2019. [arXiv] |
Amit |
Oct 23, 2019 |
Title: Learning to Learn More with Less
Abstract Understanding how humans and machines learn from few examples remains a fundamental challenge. Humans are remarkably able to grasp a new concept from just few examples, or learn a new skill from just few trials. By contrast, state-of-the-art machine learning techniques typically require thousands of training examples and often break down if the training sample set is too small. In this talk, I will discuss our efforts towards endowing visual learning systems with few-shot learning ability. Our key insight is that the visual world is well structured and highly predictable not only in feature spaces but also in under-explored model and data spaces. Such structures and regularities enable the systems to learn how to learn new tasks rapidly by reusing previous experiences. I will focus on a few topics to demonstrate how to leverage this idea of learning to learn, or meta-learning, to address a broad range of few-shot learning tasks: meta-learning in model space and task-oriented generative modeling. I will also discuss some ongoing work towards building machines that are able to operate in highly dynamic and open environments, making intelligent and independent decisions based on insufficient information. Bio Yuxiong Wang is a postdoctoral fellow in the Robotics Institute at Carnegie Mellon University. He received a Ph.D. in robotics in 2018 from Carnegie Mellon University. His research interests lie in the intersection of computer vision, machine learning, and robotics, with a particular focus on few-shot learning and meta-learning. He has spent time at Facebook AI Research (FAIR). |
Guest speaker: Yuxiong Wang |
Oct 30, 2019 | Learning Language Games through Interaction. Sida I. Wang, Percy Liang, Christopher D. Manning. ACL 2016. [arXiv] | Arjun C. |
Nov 6, 2019 | CVPR deadline approaching | -- |
Nov 13, 2019 | CVPR deadline approaching | -- |
Nov 20, 2019 | Fashion++: Minimal Edits for Outfit Improvement. Wei-Lin Hsiao, Isay Katsman, Chao-Yuan Wu, Devi Parikh, Kristen Grauman. ICCV 2019. [arXiv] | Meera |
Nov 27, 2019 | Thanksgiving break | -- |
Dec 4, 2019 | No meeting | -- |
Dec 11, 2019 | Final exams | -- |
Spring 2019
Date | Paper(s) | Presenter |
---|---|---|
Jan 14, 2019 | Kick-off meeting | Cusuh |
Jan 21, 2019 | MLK Jr. Day | -- |
Jan 28, 2019 | First-Person Hand Action Benchmark with RGB-D Videos and 3D Hand Pose Annotations. Guillermo Garcia-Hernando, Shanxin Yuan, Seungryul Baek, Tae-Kyun Kim. CVPR 2018. [paper] | Samarth B. |
Synthesis of Detailed Hand Manipulations Using Contact Sampling. Yuting Ye, C. Karen Liu. SIGGRAPH 2012. [paper] | ||
V2V-PoseNet: Voxel-to-Voxel Prediction Network for Accurate 3D Hand and Human Pose Estimation from a Single Depth Map. Gyeongsik Moon, Ju Yong Chang, Kyoung Mu Lee. CVPR 2018. [paper] | ||
Hand Pose Estimation via Latent 2.5D Heatmap Regression. Umar Iqbal, Pavlo Molchanov, Thomas Breuel. ECCV 2018. [arXiv] | ||
Feb 4, 2019 | Realistic Evaluation of Semi-Supervised Learning Algorithms. Avital Oliver*, Augustus Odena*, Colin Raffel*, Ekin D. Cubuk, Ian J. Goodfellow. NeurIPS 2018. [arXiv] | Cusuh |
Feb 11, 2019 | Memory Aware Synapses: Learning what (not) to forget. Rahaf Aljundi, Francesca Babiloni, Mohamed Elhoseiny, Marcus Rohrbach, Tinne Tuytelaars. ECCV 2018. [arXiv] | Stefan |
An Empirical Study of Example Forgetting during Deep Neural Network Learning. Mariya Toneva*, Alessandro Sordoni*, Remi Tachet des Combes, Adam Trischler, Yoshua Bengio, Geoffrey J. Gordon. ICLR 2019. [arXiv] | ||
Feb 18, 2019 | CodeSLAM -- Learning a Compact, Optimisable Representation for Dense Visual SLAM. Michael Bloesch, Jan Czarnowski, Ronald Clark, Stefan Leutenegger, Andrew J. Davison. CVPR 2018. [paper] | John |
Object-Centric Photometric Bundle Adjustment with Deep Shape Prior. Rui Zhu, Chaoyang Wang, Chen-Hsuan Lin, Ziyan Wang, Simon Lucey. WACV 2018. [paper] | ||
Feb 25, 2019 | End-to-end Recovery of Human Shape and Pose. Angjoo Kanazawa, Michael J. Black, David W. Jacobs, Jitendra Malik. CVPR 2018. [paper] | Amit |
SFV: Reinforcement Learning of Physical Skills from Video. Xue Bin Peng, Angjoo Kanazawa, Jitendra Malik, Pieter Abbeel, Sergey Levine. SIGGRAPH Asia 2018. [paper] | ||
Mar 4, 2019 | Semi-Parametric Topological Memory for Navigation. Nikolay Savinov*, Alexey Dosovitskiy*, Vladlen Koltun. ICLR 2018. [arXiv] | Apoorva |
Taking a Deeper Look at the Inverse Compositional Algorithm. Zhaoyang Lv, Frank Dellaert, James M. Rehg, Andreas Geiger. CVPR 2019. [arXiv] | ||
Mar 11, 2019 | ICCV deadline approaching | -- |
Mar 18, 2019 | ICCV deadline approaching | -- |
Mar 25, 2019 | ICCV supplementary deadline approaching | -- |
Apr 1, 2019 | LaserNet: An Efficient Probabilistic 3D Object Detector for Autonomous Driving. Gregory P. Meyer*, Ankit Laddha*, Eric Kee, Carlos Vallespi-Gonzalez, Carl K. Wellington. CVPR 2019. [arXiv] | Patsorn |
Apr 8, 2019 | SlowFast Networks for Video Recognition. Christoph Feichtenhofer, Haoqi Fan, Jitendra Malik, Kaiming He. CVPR 2018. [arXiv] | Sean |
Apr 15, 2019 | Integrating New Knowledge into a Neural Network without Catastrophic Inference: Computational and Theoretical Investigations in a Hierarchically Structure Environment. 11:15am, EBB 1005. [GT Neuro Seminar calendar] | Dr. James L. McClelland, Stanford University |
Apr 22, 2019 | DeepSDF: Learning Continuous Signed Distance Functions for Shape Representation. Jeong Joon Park, Peter Florence, Julian Straub, Richard Newcombe, Steven Lovegrove. CVPR 2019. [arXiv] | Samarth M. |
Apr 29, 2019 | Final exams | -- |
Fall 2018
Date | Paper(s) | Presenter |
---|---|---|
Aug 29, 2018 | Kick-off meeting | Cusuh |
Sep 6, 2018 | Low-Shot Learning with Imprinted Weights. Hang Qi, Matthew Brown, David G. Lowe. CVPR 2018. [paper] | Jon |
Sep 13, 2018 | ECCV | -- |
Sep 20, 2018 | CornerNet: Detecting Objects as Paired Keypoints. Hei Law, Jia Deng. ECCV 2018. [arXiv] | Cusuh |
Sep 27, 2018 | Implicit 3D Orientation Learning for 6D Object Detection from RGB Images. Martin Sundermeyer, Zoltan-Csaba Marton, Maximilian Durner, Manuel Brucker, Rudolph Triebel. ECCV 2018. [paper] | Ren |
DeepIM: Deep Iterative Matching for 6D Pose Estimation. Yi Li, Gu Wang, Xiangyang Ji, Yu Xiang, Dieter Fox. ECCV 2018. [arXiv] | ||
Oct 4, 2018 | PWC-Net: CNNs for Optical Flow Using Pyramid, Warping, and Cost Volume. Deqing Sun, Xiaodong Yang, Ming-Yu Liu, Jan Kautz. CVPR 2018. [arXiv] | Sean |
Oct 11, 2018 | Eigendecomposition-free Training of Deep Networks with Zero Eigenvalue-based Losses. Zheng Dang, Kwang Moo Yi, Yinlin Hu, Fei Wang, Pascal Fua, Mathieu Salzmann. ECCV 2018. [arXiv] | Samarth |
Learning to Find Good Correspondences. Kwang Moo Yi*, Eduard Trulls*, Yuki Ono, Vincent Lepetit, Mathieu Salzmann, Pascal Fua. CVPR 2018. [arXiv] | ||
Oct 18, 2018 | Progressive Neural Architecture Search. Chenxi Liu, Barret Zoph, Maxim Neumann, Jonathon Shlens, Wei Hua, Li-Jia Li, Li Fei-Fei, Alan Yuille, Jonathan Huang, Kevin Murphy. ECCV 2018. [arXiv] | Nam |
Oct 25, 2018 | Neural 3D Mesh Renderer. Hiroharu Kato, Yoshitaka Ushiku, Tatsuya Harada. CVPR 2018. [arXiv] | Amit |
Learning Category-Specific Mesh Reconstruction from Image Collections. Angjoo Kanazawa*, Shubham Tulsiani*, Alexei A. Efros, Jitendra Malik. ECCV 2018. [arXiv] | ||
Nov 1, 2018 | Informative Features for Model Comparison. Wittawat Jitkrittum, Heishiro Kanagawa, Patsorn Sangkloy, James Hays, Bernhard Schölkopf, Arthur Gretton. NeurIPS 2018. [arXiv] | Patsorn |
Nov 8, 2018 | CVPR deadline approaching | -- |
Nov 15, 2018 | CVPR deadline approaching | -- |
Nov 22, 2018 | Thanksgiving | -- |
Nov 29, 2018 | No meeting | -- |
Dec 6, 2018 | Final exams | -- |