This Repository contains a project for the exam of Deep Learning at University of Trento.
The project consists in 2 main tasks:
Given various images of different persons taken from different cameras (a version of Market-1501 Dataset), the model should try to predict a series of attributes associated with each image and produce similar embeddings for different images of the same person. The approach proposed in the code is composed by a Pretrained ResNet as a common backbone to extract the common features from the images in order to create the embeddings; as a next step, the embeddings are fed into 12 different Convolutional heads for the attribute recognition task (one for each attribute of our list) and an additional head for the prediction of the Person ID associated with the input image. The possible attributes of an image are:
Attributes | Possible values |
---|---|
Age | Young, Teenager, Adult, Old |
Backpack | Yes, No |
Bag | Yes, No |
Hand Bag | Yes, No |
Clothes | Dress, Pants |
Down | Long, Short |
Up | Long, Short |
Hair | Short, Long |
Hat | Yes, No |
Gender | Male, Female |
Up Color | Black, White, Red, Purple, Yellow, Gray, Blue, Green, Multicolor |
Down Color | Black, White, Pink, Purple, Yellow, Gray, Blue, Green, Brown, Multicolor |
The code is stored in the Jupiter file; every additional information regarding the code itself, can be seen directly in the code or in the report above.
Alessio Dellasega: https://github.com/AleDella
Vincenzo Marco De Luca: https://github.com/vmdl
Matteo Zanella: https://github.com/MatteoZanella