Deep metrics learning summary

1 minute read

1. Summary

Metrics learning is a learning method that aims at learning how similar or related two input objects are. It has applications in ranking, in recommendation systems, visual identity tracking, face verification, and speaker verification. Each object can be considered as a feature vector inside embedding space. By learning, the distance between similar objects’ feature vector will become closer. In opposite, the distance between dissimilar objects’ feature vector will become further.

The similarity value (how similar) of input objects are calculated as follows. At first, feature vectors for each object (image, sound, text, etc) are extracted. Then, the distance (Euclid, Coisine, Manhattance, etc) between features vectors are calculated. This distance is the similarity value for 2 input objects

By using Deep Neural Network to construct a non-linear feature extractor for metrics learning, we will have “Deep Metrics Learning” (DML) methods.

Summary 1

After learning, the face image of same people will become closer while the face image of different people will become further.

Summary 2

2. Usage

DML can be used for the following task

image searching
verification
identification
clustering
Anomaly detection
few-shot-learning

2.1 Image searching

Image searching = find the similar image with query image from image database.

Summary 2

The procedure is quite simple.

Extract feature vector for all image inside database
Extract feature vector for query image
Calculate the distance

2.2 Verification

Verification = checking whether registered image and query image matches

Verification

Procedure

Extract feature vector for registered image and query image
Calculate distance between 2 vectors
Compare with threshold value

2.3 Identification

Identification = who is this people ?

Identification

2.4 Clustering

DML become the feature extractor

2.5 Anomany detection

Same as verification

2.6 few-shot learning

zero-shot object detector = train object detector to detect object never seen before

zero-shot object detector

one-shot object detector = template matching

one-shot object detector

3. DML methods

Siamese network and Triplet network are standard learning methods for DML.

See https://omoindrot.github.io/triplet-loss for more detail about triplet and siamese net

For newer methods, see https://github.com/ifeherva/DMLPlayground

TODO: more detail here

Reference

Share on

Twitter Facebook LinkedIn

Deep metrics learning summary

1. Summary

2. Usage

2.1 Image searching

2.2 Verification

2.3 Identification

2.4 Clustering

2.5 Anomany detection

2.6 few-shot learning

3. DML methods

Reference

Share on

Leave a comment

You may also enjoy

Some quick notes about invox (old name = klavis)

summary of news from image-sensors-world for July 2020

Quick note about TVM

Re-produce inferecing speed of Yolov4 using acceleration framework