Seminar on Current Works in Computer Vision

Prof. Thomas Brox

The goal of Computer Vision is to imitate the flexibility and robustness of the human visual system and has a large impact on the success of artificial intelligence in general. Research has made significant progress in recent years particularly due to deep learning. Meanwhile computer vision is interweaved with other scopes of machine learning. In this seminar we will take a detailed look at various recent papers that span a broad range of hot topics. This includes particularly the emergence of properties from self-supervised learning and their analysis.
For each paper there will be three persons, who perform a more detailed investigation of the research paper and its background, and who will give a presentation. The presentation is followed by a discussion with all participants about the merits, limitations, and perspectives of the respective paper. You will learn to read and understand contemporary research papers, to give a good oral presentation, to ask questions, and to openly discuss a research problem.

Note that the mode of the seminar changes this semester to accomodate more slots for students. Rather than one student presenting a paper, three students will cover three different aspects, typically (but not necessarily) (1) the paper's historical background, related work, and motivation, (2) the core methodology including background methodology, and (3) experimental results and their assessment. In order to provide a strong overall presentation, you will also learn to work in a team, the assembly of which is out of your control. The success of the team effort is part of the grading.

Seminar: (2 SWS)	Wednesdays, 14:00-15:30, 101-00-017/019 Contact person: Karim Farid
Beginning:	If you want to participate, attend the mandatory introduction meeting on April 22 14:00, register in HisInOne, and submit your paper preferences by April 27.
Recommended semester:	6 (Bachelor), any (Master)
Requirements:	Background in computer vision
Remarks:	The language in this course is English.
	There is a strongly related Blockseminar on Deep Learning offered by apl Prof. Olaf Ronneberger from Google DeepMind. The introduction meeting will be jointly for both seminars. Topics will be assigned for both seminars via a preference voting. Only students who attended the introduction meeting and sent their paper preferences in time will be considered. Within this group, we follow the assignments of the HisInOne system. We want to avoid that people grab a topic and then jump off during the semester. Thus, please have a coarse look at all available papers to make an informed decision before you commit. The listed papers are not yet sorted by the date of presentation.If you don't attend the introduction meeting (or not send a paper preference) but choose this seminar together with only other overbooked seminars in HisInOne, you may end up without a seminar place this semester. All participants must read all papers and must attend all sessions.

Material:

Giving a good presentation
Proper scientific behavior

Powerpoint template for your presentation (optional)

Papers:

Date	Paper	Presenting students	Slides	Advisor

3.6.	V-JEPA	Alexander Paulus Ali Safarli Niklas Rodenbüsch		Johannes Dienert
10.6.	Self-Flow	Ali Karataş Maximilian Singler Paul Meissner		Arian Mousakhan
17.6.	Block-recurrent transformers	Brunda Mokshagundam Subramanya Felix Christian Johannes Falk-Bruchhäuser Kartik Singh		Simon Schrodi
24.6.	Reasoning with mental imagary	Nattakarn Johnpradit Ojas Dillikar Tobias Schmidt		Jelena Bratulić
1.7.	Video generation with internal memory	Johannes Oskar Effenberger Kevin Niedermaier Zhipeng Guo		Silvio Galesso
8.7.	Multimodal world models	Arnav Verma Mazen Amria Tobias Heimbach		Leonhard Sommer
22.7.	In-context algebra	Amy George Jilong Xie Muhammad Shaafay Saqib		Elias Kempf