Deep Thinking Hour

upcoming events

Hackathon

organizers

Erik Bekkers, Christian Shewmake

from

AMLab UvA, New Theory AI

time

Thu 28.02.2025 10:00-17:00 CET

location

L3.36 Lab42 Science Park Ams

link

title Geometry-grounded representations in practice

abstract We are happy to announce that next week we will organise a hackathon centered around Equivariant Neural Fields (ENF) and geometry-grounded continuous representations, sponsored by ELLIS and New Theory!
This day will consist of presentations in the morning, of dr. Erik J Bekkers on geometry-grounded representations and his vision on how to improve representation learning in the coming years. Secondly, Christian Shewmake--CEO of exciting young startup New Theory--will introduce their view on intelligence. After, two Davids will introduce the framework and hackathon in further detail.
The hackathon focusses on coming up with new inference or encoding schemes based on ENFs. ENFs try to overcome texture bias in CNNs and ViTs by grounding representations in geometry. Currently, geometry-grounded methods still struggle to create meaningful positional features and at the same time be as expressive as sota models such as CNNs and standard Transformers. With this hackathon we want to introduce you to ENFs and challenge you to try figure out solutions for new encoding schemes resulting in meaningful positional features. We will work in teams on the hackathon and have a benchmark to evaluate on. The highest performance will be granted honour for life and maybe some chocolate. We hope to see you there! If you want to participate, please sign up in the following form.

past events

tutorial

speaker

Alex Gabel

from

VISLab, UvA

time

Wed 27.11.2024 17:00-19:00 CET

location

L3.33 Lab42 Science Park Ams

livestream

zoom link

title Differential geometry for Deep Learning (Part 2)

abstract Join us for the second part of our "Differential Geometry for Deep Learning" workshop, where we delve deeper into the geometric foundations essential for advanced machine learning research. After a short recap, we'll explore bump functions and partitions of unity, addressing topics we couldn't cover previously. The session includes a series of worked exercises designed to solidify understanding, and a brief discussion on the practical applications of these concepts in machine learning, including charts, submanifolds, and partitions of unity.
Our new material will introduce you to tangent bundles, vector fields, and their associated structures such as Lie brackets, integral curves, and 1-parameter flows. We will also touch on fibre bundles and, if time allows, delve into differential forms and De Rham cohomology. This workshop is tailored for those with an interest in the intersection of differential geometry and deep learning, providing both theoretical insights and practical applications to enhance your research and understanding in this cutting-edge field.

tutorial

speaker

David W. Romero

from

NVIDIA

time

Thu 11.04.2024 09:00-11:00 CET

location

L1.01 Lab42 Science Park Ams

recording

youtube link

title Beyond Transformers: Exploring Subquadratic Long-Context Architectures

abstract Transformers are powerful but challenging to scale for tasks with long context due to their quadratic computational cost relative to context length. This limitation prompted the development of alternative architectures scaling sub-quadratically. This tutorial delves into recent developments in subquadratic long-context architectures, focusing on their foundations and mechanisms. Starting with State-Space Models (SSMs), particularly the S4 model, which combines recurrence and convolution. We then explore convolutional models like Hyena, Orchid, and CKConv, which don't rely on SSM formulation, and recent recurrent models like Mamba. Assessing strengths and limitations of each model family, we conclude with a look into future research directions. Attendees gain an understanding of modern subquadratic architectures' significance for Deep Learning applications.

tutorial

speaker

Phillip Lippe

from

VISLab, UvA

time

Mon 11.03.2024 17:00-19:00 CET

location

C0.110 SP904 Science Park Ams

RSVP

fill this form

livestream

zoom link

resources

slides notebooks

title Training models at scale

abstract This tutorial equips you with the knowledge to efficiently train large models 🔥. We'll explore various distributed training strategies like fully-sharded data parallelism, pipeline parallelism, and tensor parallelism, alongside single-GPU optimizations including mixed precision training and gradient checkpointing. The tutorial will be framework-agnostic, so no prior knowledge in JAX or PyTorch is needed. By the end, you'll gain the skills to navigate the complexities of large-scale training.

tutorial

speaker

Alex Gabel

from

VISLab, UvA

time

Wed 06.03.2024 17:00-19:00 CET

location

A1.16 SP904 Science Park Ams

title Differential geometry for deep learning

abstract Differential manifolds for machine learning researchers, covering fundamental concepts such as charts, partitions of unity, and fiber bundles. Emphasizing the construction of global structures from local properties, particularly in Euclidean space, the tutorial addresses advanced topics like differential forms and integration with applications to machine learning. Throughout, the tutorial underscores the importance of these mathematical tools in understanding complex data structures and improving modeling techniques, integrating references to practical applications within the field for researchers.

seminar

speaker

Rianne van den Berg

from

MSFT Research

time

Wed 10.05.2023 16:00 CET

location

L3.36 Lab 42 Science Park Ams

title AI4Science at Microsoft Research

abstract In July 2022 Microsoft announced a new global team in Microsoft Research, spanning the UK, China and the Netherlands, to focus on AI for science. In this talk I will discuss some of the research areas that we are currently exploring in AI4Science at Microsoft Research, covering topics such as drug discovery, material generation, neural PDE solvers, electronic structure theory. I will then dive deeper into two examples of projects recently done at Microsoft Research.

seminar

speaker

David Ruhe

from

AMLab, UvA

time

Wed 01.03.2023 16:00 CET

location

L3.36 Lab 42 Science Park Ams

title Geometric Clifford Algebra Networks

abstract In this talk, I explain our recently proposed Geometric Clifford Algebra Networks (GCANs) that are based on symmetry group transformations using geometric (Clifford) algebras. GCANs are particularly well-suited for representing and manipulating geometric transformations, often found in dynamical systems. Theoretical advantages are strongly reflected in the modeling of three-dimensional rigid body transformations as well as large-scale fluid dynamics simulations, showing significantly improved performance over traditional methods.

panel discussion

panelists

Jakub Tomczak¹, Yuki Asano², Efstratios Gavves², Emiel Hoogenboom³

from

TUe¹, UvA², Google Brain³

time

Thu 19.01.2023 14:00 CET

location

L3.36 Lab 42 Science Park Ams

title Modelling versus scaling in modern Deep Learning

abstract What does it mean to accurately model using generative models; is it about building informative representations of real-world data? Do they allow us to investigate questions and ideas about the world that we couldn’t before? Recent foundation model developments - DALLE, Imagen, ChatGPT, GPT4 - seem to achieve incredible performance by leveraging enormous resources both in terms of computation and data. What are the limits of such data and compute scaling? Should (academic) researchers focus their attention on better scaling algorithms? Is there even any role left for modelling through inductive biases in this era of large-scale models? All this and more will be covered in this first edition of our panel discussion format, by an invited panel of influential researchers.