DEFESA DE DISSERTAÇÃO DE MESTRADO
Aluno: Matheus Vinícius Todescato
Orientador: Prof. Dr. Joel Luís Carbonera
Título: An image classification approach based on graph convolutional networks and patch-based multiscale feature graphs
Linha de Pesquisa: Aprendizado de Máquina, Representação de Conhecimento e Raciocínio
Local: Esta banca ocorrerá de forma remota. Acesso público disponibilizado pelo link: https://mconf.ufrgs.br/webconf/00179534 .
– Prof. Dr. Claudio Rosito Jung (UFRGS)
– Prof. Dr. Thiago Lopes Trugillo da Silveira (UFRGS)
– Prof. Dr. Maciel Zortea (IBM Research)
Presidente da Banca: Prof. Dr. Joel Luís Carbonera
Abstract: Deep learning architectures have demonstrated impressive results in image classification in the last few years. However, applying sophisticated neural network architectures in small datasets remains challenging. In this context, transfer learning is a promising approach for dealing with this scenario. Generally, the available pre-trained architectures adopt a standard fixed input, which usually implies resizing and cropping the input images in the preprocessing phase, causing information loss. Besides, images present visual features in different scales in real-world scenarios, and most common approaches do not consider this fact. In this work, we propose an approach that applies transfer learning for dealing with small datasets and leverages visual features extracted by pre-trained models from different scales. We based our approach on graph convolutional networks (GCN) that take graphs representing the images in different scales as input and whose nodes are characterized by features extracted by pre-trained models from regular image patches of different scales. Since GCN can deal with graphs with different numbers of nodes, our approach can deal naturally with images of heterogeneous sizes without discarding relevant information. We evaluated our approach in two datasets: a set of geological images and a publicly available dataset, presenting characteristics that challenge traditional approaches. We tested our approach by adopting three different pre-trained models as feature extractors: two efficient pre-trained CNN models (DenseNet and ResNeXt) and one Vision Transformer (CLIP). We compared our approach with two conventional approaches for dealing with image classification. The experiments show that our approach achieves better results than the conventional approaches for this task.
Keywords: Image Classification. Graph Convolutional Network. Transfer Learning. Feature Extraction. Multiscale.