accepted tutorials

1. Tiny Titans: Efficient Large Vision, Language and Multimodal Models through Pruning

September 30th, 2025
Time: 09:00 am
Duration: 3h
Level: Intermediate

Notable progress in solving complex reasoning tasks relies on large models. Unfortunately, developing these models demands substantial computational resources and energy consumption. Hence, the industry pushes the most significant advances in state-of-the-art models and draws the attention of the scientific community to the environmental impact of AI (GreenAI). Pruning emerges as an effective mechanism to address the capacity-computational cost dilemma by eliminating structures (weights, neurons or layers) from deep models. This tutorial introduces theoretical and technical foundations within this promising, active and exciting field. It delves into pruning techniques as a pillar of GreenAI and a foundation for the next wave of efficient large vision, language, and multimodal models. Our tutorial also covers how existing forms of pruning impact efficiency gains, guiding participants to make informed choices for their scenario and infrastructure. Specifically, we equip participants with the basics and key recipes to effectively apply pruning in practical computer vision scenarios. Given the calls for efficient general-purpose AI models, we believe our tutorial serves as a valuable tool for practitioners.

Carolina Tavares

Universidade de São Paulo

Leandro Mugnain

Universidade de São Paulo

Gustavo Nascimento

Universidade de São Paulo

Ian Pons

Universidade de São Paulo

Keith Ogawa

Universidade de São Paulo

Guilherme Stern

Universidade de São Paulo

Lucas Libanio

Universidade de São Paulo

Aline Paes

Universidade Federal Fluminense

Anna Helena Reali

Universidade de São Paulo

Artur Jordao

Universidade de São Paulo

2. Rank-based Unsupervised Similarity Learning: Framework and Applications

September 30th, 2025
Time: 09:00 am
Duration: 3h
Level: Intermediate

Multimedia data collections have grown exponentially due to advances in acquisition and sharing technologies, creating a pressing need for robust similarity learning methods that do not depend on costly annotations. Deep features extracted by convolutional and transformer networks provide powerful embeddings, but often lie on complex manifolds that simple pairwise comparisons fail to capture. Unsupervised similarity learning addresses this gap by post-processing these embeddings with rank-based strategies that leverage the ordering of neighbors and manifold geometry through different approaches such as graph and hypergraph constructions. This tutorial discusses foundational concepts in contextual similarity and presents a comprehensive overview of rank-based methods for unsupervised distance and similarity learning. It introduces the Unsupervised Distance Learning Framework (UDLF) along with its Python wrapper pyUDLF and the UDLFWeb interface to automate and facilitate experimentation and visualization of results. We show how these tools enhance performance in image retrieval, classification, and clustering tasks and discuss future directions for extending rank-based methods to other modalities and scaling to large datasets.

Italo de Matos Saldanha

Universidade de São Paulo

Vinicius Atsushi Sato Kawai

UNESP

Bionda Rozi

UNESP

Gustavo Rosseto Leticio

UNESP

Lucas Pascotti Valem

Universidade de São Paulo

Daniel Carlos Guimarães Pedronette

UNESP

3. The Flow of Creation: A Tour of Flow Matching for Visuals

September 30th, 2025
Time: 09:00 am
Duration: 3h
Level: Advanced

Flow Matching has recently emerged as an efficient alternative to the generative method paradigms. Here we aim to provide researchers and practitioners with both the theoretical and the practical aspects of this technique. We delve into the continuous formulation, where a neural network learns a vector field to transform noise into data via an ordinary differential equation, and also explore its discrete counterpart. The paper covers the entire workflow, from the core mathematical concepts and training objectives to sampling procedures, including classifier-free guidance and conditioning generation. By showcasing a diverse range of applications—from image synthesis and human motion generation to computational biology and robotics—this work equips readers with the essential knowledge to apply and innovate with the versatile and computationally efficient flow matching framework.

Manolo Canales Cuba

Universidade Federal do ABC

Vinicius Melício

Universidade Federal do ABC

João Paulo Gois

Universidade Federal do ABC

4. Getting Started with Semantic Segmentation in PyTorch Using SMP

September 30th, 2025
Time: 01:00 pm
Duration: 3h
Level: Intermediate

Semantic segmentation is a core task in computer vision, essential for applications requiring detailed scene understanding, such as medical imaging, precision agriculture, and remote sensing. Recent advances in deep learning have significantly enhanced segmentation performance, particularly through encoder-decoder architectures combined with transfer learning. This tutorial provides a practical introduction to semantic segmentation using the Segmentation Models PyTorch (SMP) library, a widely adopted framework that integrates state-of-the-art architectures with pretrained encoders in an accessible interface. We offer a comprehensive overview of key concepts, supported model architectures, loss functions, evaluation metrics, and training strategies, emphasizing transparency and flexibility through native PyTorch implementations. To reinforce the concepts, we present two case studies: binary segmentation with the 38-Cloud dataset and multiclass segmentation with the DeepGlobe dataset. Both illustrate real-world applications, model configuration, preprocessing, and performance evaluation. All tutorial materials, including source code and reproducible experiments, will be made publicly available. The goal is to equip participants with practical knowledge to design, train, and evaluate semantic segmentation models effectively in a variety of domains.

Joao Mari

UFV

Leandro Silva

Federal University of Viçosa

Mauricio Escarpinati

Federal University of Uberlândia

André Backes

Federal University of São Carlos

5. Regularization for Inverse Problems and Machine Learning: A Tutorial

September 30th, 2025
Duration: 3h
Time: 01:00 pm
Level: Intermediate

In this tutorial, we explore different paradigms for solving image processing tasks such as denoising and deblurring. These tasks can be interpreted as ill-posed inverse problems, grounded in physical-mathematical models that require regularization to produce meaningful solutions. Alternatively, they can be framed as supervised regression tasks within a machine learning context, where regularization techniques are used to improve generalization to unseen data. We compare these two perspectives, highlighting their similarities and differences—particularly in how regularization is understood and applied in each framework.

Roberto Gutierrez Beraldo

CECS/UFABC

Leonardo Alves Ferreira

Samsung

Ricardo Suyama

CECS/UFABC

6. Symmetry Shape Analysis

September 30th, 2025
Time: 01:00 pm
Duration: 3h
Level: Intermediate

Symmetry is a fundamental and pervasive property found in both natural and man-made objects, playing a key role in aesthetics, structure, and function. In computational domains, symmetry serves as a powerful cue for data compression, structure inference, and shape understanding. This work presents a comprehensive overview of symmetry analysis in 3D shapes, with a particular focus on computational methods for symmetry detection and their applications in diverse fields such as CAD, computer vision, medicine, archaeology, and 3D modeling. We provide formal definitions of exact, approximate, and partial symmetries in the context of rigid transformations, and we survey five major categories of detection approaches: transformation-based, correspondence-based, voting-based, optimization-based, and learning-based methods. Special emphasis is placed on recent deep learning techniques, which have significantly advanced the state of the art yet face challenges in generalization and robustness. Finally, we identify key open problems and future directions, including the need for richer and more varied datasets, better generalization of learning-based models, effective formulations for symmetry detection in incomplete data, and the integration of symmetry priors in generative modeling. Our analysis highlights both the progress and the limitations of current methods and aims to guide future research toward more principled and capable symmetry-aware systems.

Ivan Sipiran

Department of Computer Science, University of Chile

7. From Volume Rendering to 3D Gaussian Splatting: Theory and Applications

September 30th, 2025
Time: 01:00 pm
Duration: 3h
Level: Advanced

Deep learning has transformed medical image analysis. However, building effective models for volumetric data, such as CT, MRI, and PET scans, presents a new set of challenges. These include high computational costs, limited annotated datasets, and the need for architectures that can process volumetric data directly. This survey provides a comprehensive overview of recent advances in deep neural network architectures specifically designed for 3D medical imaging. We analyze the progression from early 3D CNNs to hybrid and Transformer-based models, highlighting their structural innovations and applicability. In addition, we review training strategies such as supervised pretraining, self-supervised learning, and the development of general-purpose 3D foundational models. Also, a comprehensive overview of 3D medical imaging datasets and their associated tasks is presented in the supplementary materials, highlighting the diverse clinical objectives that deep learning models can support across healthcare and diagnostic applications. A dedicated section addresses the emerging field of explainability in volumetric contexts, emphasizing the limitations of adapting 2D explainability-focused tools and the importance of 3D-native explanation frameworks. We conclude by outlining the key trends in the field, including the growing availability of multiple and functionally diverse datasets; novel architectural designs tailored for volumetric learning; the transition from convolutional to attention-based models; increased reliance on transfer learning from both labeled and unlabeled data; and the emergence of 3D-native XAI methods that provide clinically meaningful and trustworthy explanations.

Vitor Matias

Universidade de São Paulo

Daniel Perazzo

IMPA

Vinícius da Silva

Tecgraf PUC-Rio

Alberto Raposo

Tecgraf PUC-Rio

Luiz Velho

IMPA

Afonso Paiva

ICMC-USP

Tiago Novello

IMPA

8. Strategies for Deep Learning in Volumetric Medical Imaging: A Survey

September 30th, 2025
Time: 09:00 am (morning session), 01:00 pm (afternoon session)
Duration: 6h
Level: Advanced

The problem of 3D reconstruction from posed images is undergoing a fundamental transformation, driven by continuous advances in 3D Gaussian Splatting (3DGS). By modeling scenes explicitly as collections of 3D Gaussians, 3DGS enables efficient rasterization through volumetric splatting, offering thus a seamless integration with common graphics pipelines. Despite its real-time rendering capabilities for novel view synthesis, 3DGS suffers from a high memory footprint, the tendency to bake lighting effects directly into its representation, and limited support for secondary-ray effects. This tutorial provides a concise yet comprehensive overview of the 3DGS pipeline, starting from its splatting formulation and then exploring the main efforts in addressing its limitations. Finally, we survey a range of applications that leverage 3DGS for surface reconstruction, avatar modeling, animation, and content generation — highlighting its efficient rendering and suitability for feed-forward pipelines.

João Vitor Silva de Oliveira

Universidade Federal de Viçosa

Danilo Vieira

Universidade Federal de Viçosa

Mateus Silva

Universidade Federal de Viçosa

Daniel Fernandes

Universidade Federal de Viçosa

Marcos Ribeiro

Universidade Federal de Viçosa

Hugo Oliveira

Universidade Federal de Viçosa
Rolar para cima