A journal of IEEE and CAA , publishes high-quality papers in English on original theoretical/experimental research and development in all areas of automation
Volume 10 Issue 9
Sep.  2023

IEEE/CAA Journal of Automatica Sinica

  • JCR Impact Factor: 11.8, Top 4% (SCI Q1)
    CiteScore: 17.6, Top 3% (Q1)
    Google Scholar h5-index: 77, TOP 5
Turn off MathJax
Article Contents
Y. Y. Li, C. Q. Fei, C. Q. Wang, H. M. Shan, and R. Q. Lu, “Geometry flow-based deep riemannian metric learning,” IEEE/CAA J. Autom. Sinica, vol. 10, no. 9, pp. 1882–1892, Sept. 2023. doi: 10.1109/JAS.2023.123399
Citation: Y. Y. Li, C. Q. Fei, C. Q. Wang, H. M. Shan, and R. Q. Lu, “Geometry flow-based deep riemannian metric learning,” IEEE/CAA J. Autom. Sinica, vol. 10, no. 9, pp. 1882–1892, Sept. 2023. doi: 10.1109/JAS.2023.123399

Geometry Flow-Based Deep Riemannian Metric Learning

doi: 10.1109/JAS.2023.123399
Funds:  This work was supported in part by the Young Elite Scientists Sponsorship Program by CAST (2022QNRC001), the National Natural Science Foundation of China (61621003, 62101136), Natural Science Foundation of Shanghai (21ZR1403600), Shanghai Municipal Science and Technology Major Project (2018SHZDZX01) and ZJLab, and Shanghai Municipal of Science and Technology Project (20JC1419500)
More Information
  • Deep metric learning (DML) has achieved great results on visual understanding tasks by seamlessly integrating conventional metric learning with deep neural networks. Existing deep metric learning methods focus on designing pair-based distance loss to decrease intra-class distance while increasing inter-class distance. However, these methods fail to preserve the geometric structure of data in the embedding space, which leads to the spatial structure shift across mini-batches and may slow down the convergence of embedding learning. To alleviate these issues, by assuming that the input data is embedded in a lower-dimensional sub-manifold, we propose a novel deep Riemannian metric learning (DRML) framework that exploits the non-Euclidean geometric structural information. Considering that the curvature information of data measures how much the Riemannian (non-Euclidean) metric deviates from the Euclidean metric, we leverage geometry flow, which is called a geometric evolution equation, to characterize the relation between the Riemannian metric and its curvature. Our DRML not only regularizes the local neighborhoods connection of the embeddings at the hidden layer but also adapts the embeddings to preserve the geometric structure of the data. On several benchmark datasets, the proposed DRML outperforms all existing methods and these results demonstrate its effectiveness.

     

  • loading
  • [1]
    J. B. Tenenbaum, V. De Silva, and J. C. Langford, “A global geometric framework for nonlinear dimensionality reduction,” Science, vol. 290, no. 5500, pp. 2319–2323, 2000. doi: 10.1126/science.290.5500.2319
    [2]
    B. Lin, X. He, and J. Ye, “A geometric viewpoint of manifold learning,” Applied Informatics, 2015.
    [3]
    E. Xing, M. Jordan, S. J. Russell, and A. Ng, “Distance metric learning with application to clustering with side information,” Advances in Neural Information Processing Systems, vol. 15, pp. 521–528, 2002.
    [4]
    K. Q. Weinberger and L. K. Saul, “Distance metric learning for large margin nearest neighbor classification,” Journal of Machine Learning Research, vol. 10, p. 2, 2009.
    [5]
    J. V. Davis, B. Kulis, P. Jain, S. Sra, and I. S. Dhillon,”Information-theoretic metric learning,” in Proc. 24th Int. Conf. Machine Learning, pp. 209–216, 2007.
    [6]
    F. Schroff, D. Kalenichenko, and J. Philbin, “FaceNet: A unified embedding for face recognition and clustering,” in Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 815–823, 2015.
    [7]
    Y. Jin, Y. Dong, Y. Zhang, and X. Hu, “SSMD: Dimensionality reduction and classification of hyperspectral images based on spatial-spectral manifold distance metric learning,” IEEE Trans. Geoscience and Remote Sensing, vol. 60, p. 5538916, 2022.
    [8]
    Y. Wen, K. Zhang, Z. Li, and Y. Qiao, “A discriminative feature learning approach for deep face recognition,” in Proc. European Conf. Computer Vision, pp. 499–515, Springer, 2016.
    [9]
    X. He, Y. Zhou, Z. Zhou, S. Bai, and X. Bai, “Triplet-center loss for multi-view 3D object retrieval,” in Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 1945–1954, 2018.
    [10]
    H. Wang, Y. Wang, Z. Zhou, X. Ji, D. Gong, J. Zhou, Z. Li, and W. Liu, “Cosface: Large margin cosine loss for deep face recognition,” in Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 5265–5274, 2018.
    [11]
    M. M. Bronstein, J. Bruna, Y. LeCun, A. Szlam, and Vandergheynst, “Geometric deep learning: Going beyond euclidean data,” IEEE Signal Processing Magazine, vol. 34, no. 4, pp. 18–42, 2017. doi: 10.1109/MSP.2017.2693418
    [12]
    J. H. White and R. W. Beard, “An iterative pose estimation algorithm based on epipolar geometry with application to multi-target tracking,” IEEE/CAA J. Autom. Sinica, vol. 7, no. 4, pp. 942–953, 2020. doi: 10.1109/JAS.2020.1003222
    [13]
    D. Ye, B. Desjardins, J. Hamm, H. Litt, and K. M. Pohl, “Regional manifold learning for disease classification,” IEEE Trans. Medical Imaging, vol. 33, no. 6, pp. 1236–1247, 2014. doi: 10.1109/TMI.2014.2305751
    [14]
    X. Chen, J. Weng, W. Lu, J. Xu, and J. Weng, “Deep manifold learning combined with convolutional neural networks for action recognition,” IEEE Trans. Neural Networks and Learning Systems, vol. 29, no. 9, pp. 3938–3952, 2017.
    [15]
    D. Wu and X. Luo, “Robust latent factor analysis for precise representation of high-dimensional and sparse data,” IEEE/CAA J. Autom. Sinica, vol. 8, no. 4, pp. 796–805, 2020.
    [16]
    D. Li, W. Hung, J. Huang, S. Wang, N. Ahuja, and M. Yang, “Unsupervised visual representation learning by graph-based consistent constraints,” in Proc. European Conf. Computer Vision, pp. 678–694, 2016.
    [17]
    Y. Li and R. Lu, “Riemannian metric learning based on curvature flow,” in Proc. 24th Int. Conf. Pattern Recognition, pp. 806–811, 2018.
    [18]
    G. Taubin, “A signal processing approach to fair surface design,” in Proc. 22nd Annu. Conf. Computer Graphics and Interactive Techniques, pp. 351–358, 1995.
    [19]
    K. Sohn, “Improved deep metric learning with multi-class n-pair loss objective,” in Advances in Neural Information Processing Systems, pp. 1857–1865, 2016.
    [20]
    W. Ge, “Deep metric learning with hierarchical triplet loss,” in Proc. European Conf. Computer Vision, pp. 269–285, 2018.
    [21]
    X. Wang, X. Han, W. Huang, D. Dong, and M. R. Scott, “Multi-similarity loss with general pair weighting for deep metric learning,” in Proc. IEEE/CVF Conf. Computer Vision and Pattern Recognition, pp. 5022–5030, 2019.
    [22]
    E. Ustinova and V. Lempitsky, “Learning deep embeddings with histogram loss,” arXiv preprint arXiv: 1611.00822, 2016.
    [23]
    J. Wang, F. Zhou, S. Wen, X. Liu, and Y. Lin, “Deep metric learning with angular loss,” in Proc. IEEE Int. Conf. Computer Vision, pp. 2593–2601, 2017.
    [24]
    D. Zhang, Y. Li, and Z. Zhang, “Deep metric learning with spherical embedding,” Advances in Neural Information Processing Systems, vol. 33, 2020.
    [25]
    W. Zhao, Y. Rao, Z. Wang, J. Lu, and J. Zhou, “Towards interpretable deep metric learning with structural matching,” in Proc. IEEE/CVF Int. Conf. Computer Vision, pp. 9887–9896, 2021.
    [26]
    B. Chen, P. Li, Z. Yan, B. Wang, and L. Zhang, “Deep metric learning with graph consistency,” in Proc. 35th AAAI Conf. Artificial Intelligence, pp. 982–990, 2021.
    [27]
    J. Seidenschwarz, I. Elezi, and L. Leal-Taixe, “Learning intra-batch connections for deep metric learning,” in Proc. 38th Int. Conf. Machine Learning, 2021.
    [28]
    Y. Ollivier, “Ricci curvature of markov chains on metric spaces,” Journal of Functional Analysis, vol. 256, no. 3, pp. 810–864, 2009. doi: 10.1016/j.jfa.2008.11.001
    [29]
    K. I. Kim, J. Tompkin, and C. Theobalt, “Curvature-aware regularization on riemannian submanifolds,” in Proc. IEEE Int. Conf. Computer Vision, pp. 881–888, 2013.
    [30]
    D. Gong, X. Zhao, and G. Medioni, “Robust multiple manifolds structure learning,” arXiv preprint arXiv: 1206.4624, 2012.
    [31]
    Y. Li, “Curvature-aware manifold learning,” Pattern Recognition, vol. 83, pp. 273–286, 2018. doi: 10.1016/j.patcog.2018.06.007
    [32]
    W. Xu, E. R. Hancock, and R. C. Wilson, “Ricci flow embedding for rectifying non-euclidean dissimilarity data,” Pattern Recognition, vol. 47, no. 11, pp. 3709–3725, 2014. doi: 10.1016/j.patcog.2014.04.021
    [33]
    H. Li, J. Cao, J. Zhu, Y. Liu, Q. Zhu, and G. Wu, “Curvature graph neural network,” Information Sciences, vol. 592, pp. 50–66, 2022. doi: 10.1016/j.ins.2021.12.077
    [34]
    Z. Ye, K. S. Liu, T. Ma, J. Gao, and C. Chen, “Curvature graph network,” in Proc. Int. Conf. Learning Representations, 2019.
    [35]
    C. Ni, Y. Lin, J. Gao, and X. Gu, “Network alignment by discrete ollivier-Ricci flow,” in Proc. Int. Symp. Graph Drawing and Network Visualization, pp. 447–462, Springer, 2018.
    [36]
    C. Ni, Y. Lin, F. Luo, and J. Gao, “Community detection on networks with Ricci flow,” Scientific Reports, vol. 9, no. 1, pp. 1–12, 2019. doi: 10.1038/s41598-018-37186-2
    [37]
    N. G. J. David, T. Guarrera, and H. F. Wolfe, “The tayer expansion of a riemannian metric,” [online] Available: https://studylib.net/doc/13872779/the-taylor-expansion-of-a-riemannian-metric, 2002.
    [38]
    B. Chow and D. Knopf, “The Ricci flow: An introduction,” Mathematical Surveys and Monographs, vol. 110, 2008.
    [39]
    R. S. Hamilton, “Three-manifolds with positive Ricci curvature,” Journal of Differential Geometry, vol. 17, no. 2, pp. 255–306, 1982.
    [40]
    S. Brendle and R. Schoen, “Curvature, Sphere theorems, and the Ricci flow,” arXiv preprint arXiv: 1001.2278v2, 2010.
    [41]
    Y. Li and R. Lu, “Applying Ricci flow to high dimensional manifold learning,” Science China Information Sciences, vol. 62, no. 9, pp. 192101:1–192101:14, 2019.
    [42]
    S. Cost and S. Salzberg, “A weighted nearest neighbor algorithm for learning with symbolic features,” Machine Learning, vol. 10, pp. 57–78, 1993.
    [43]
    C. Wah, S. Branson, P. Welinder, P. Perona, and S. Belongie, “The caltech-ucsd birds-200-2011 dataset,” 2011.
    [44]
    J. Krause, M. Stark, J. Deng, and F. Li, “3D object representations for fine-grained categorization,” in Proc. IEEE Int. Conf. Computer Vision Workshops, pp. 554–561, 2013.
    [45]
    H. Oh Song, Y. Xiang, S. Jegelka, S. Savarese, “Deep metric learning via lifted structured feature embedding,” in Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 4004–4012, 2016.
    [46]
    S. Ioffe and C. Szegedy, “Batch normalization: Accelerating deep network training by reducing internal covariate shift,” in Proc. Int. Conf. Machine Learning, pp. 448–456, 2015.
    [47]
    J. Deng, W. Dong, R. Socher, L. Li, K. Li, and F. Li, “ImageNet: A large-scale hierarchical image database,” in Proc. IEEE Conf. Computer Vision and Pattern Recognition, pp. 248–255, 2009.

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Figures(8)  / Tables(3)

    Article Metrics

    Article views (268) PDF downloads(102) Cited by()

    Highlights

    • A new deep Riemannian metric learning (DRML) is proposed via the geometry flow with adding the geometric structure as a regularization term to the hidden layer
    • DRML firstly uses the curvature information of feature distributions for the regularization of the deep metric learning
    • The effectiveness of DRML has been evaluated on the convergence of the embedding learning as well as the performance of the image clustering

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return