SaliencyI2PLoc/docs/Architectures.md at master · whu-lyh/SaliencyI2PLoc

SaliencyI2PLoc

Our SaliencyI2PLoc architecture is illustrated in

SaliencyI2PLoc encodes the input image-point cloud pairs into a high-dimensional feature embedding space using a feature encoder (ViT for images, mini-PointNet combined with Transformer for point clouds) and feature aggregator (saliency map boosted NetVLAD layer). It then achieves feature fusion and alignment through the contrastive learning loss function that incorporates cross-modal feature relationship consistency constraints.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SaliencyI2PLoc

FilesExpand file tree

Architectures.md

Latest commit

History

Architectures.md

File metadata and controls

SaliencyI2PLoc