This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Principal Curvatures Estimation with Applications to Single Cell Data

Yanlei Zhang1, 2, , Lydia Mezrag1, 2, , Xingzhi Sun3, Charles Xu3, 4, Kincaid Macdonald3, Dhananjay Bhaskar3,4,
Smita Krishnaswamy2, 3, 4, 5, {\dagger}, Guy Wolf1, 2, 7, {\dagger} and Bastian Rieck6, 7, {\dagger}
This research was partially funded by Mitacs Globalink Research Award IT40964 [L.M.]; Yale – Boehringer Ingelheim Biomedical Data Science Fellowship [D.B.]; Hightech Agenda Bavaria, Swiss State Secretariat for Education, Research and Innovation [B.R.]; Humboldt Research Fellowship, CIFAR AI Chair, NSERC Discovery grant 03267, FRQNT grant 343567 [G.W.]; CRM-Simons visiting professor award, NSF career grant 2047856 [S.K.]; and NSF grant DMS-2327211 [G.W., S.K.]. The content provided here is solely the responsibility of the authors and does not necessarily represent the official views of the funding agencies. Correspondence to [email protected] and [email protected] 1Université de Montréal, Dept. of Math. & Stat.; 2Mila – Quebec AI Institute, Montréal, QC, CA 3Yale University, Dept. of Comp. Sci.; 4Dept. of Genetics; 5Applied Mathematics Program, New Haven, CT, USA 6Université de Fribourg, Department of Informatics, Fribourg, FR, CH 7Helmholtz Zentrum München, Institute of AI for Health, Munich, BY, DE *Equal contribution; {\dagger}Co-senior authors.
Abstract

The rapidly growing field of single-cell transcriptomic sequencing (scRNAseq) presents challenges for data analysis due to its massive datasets. A common method in manifold learning consists in hypothesizing that datasets lie on a lower dimensional manifold. This allows to study the geometry of point clouds by extracting meaningful descriptors like curvature. In this work, we will present Adaptive Local PCA (AdaL-PCA), a data-driven method for accurately estimating various notions of intrinsic curvature on data manifolds, in particular principal curvatures for surfaces. The model relies on local PCA to estimate the tangent spaces. The evaluation of AdaL-PCA on sampled surfaces shows state-of-the-art results. Combined with a PHATE embedding, the model applied to single-cell RNA sequencing data allows us to identify key variations in the cellular differentiation.

Index Terms:
Principal curvature, Gaussian curvature, single-cell, principal directions.

I Introduction

The estimation of principal curvatures and principal directions is crucial in uncovering directional changes within data manifolds. Indeed, the mean and Gaussian curvatures of surfaces have been studied for several decades in computer graphics and some related areas (e.g., [1, 2, 3, 4, 5]). Recent methods have proposed to estimate curvature over data manifolds derived from point-cloud data via manifold learning techniques (e.g., [6, 7, 8]). However, achieving precision is challenging given the variations in data density and the necessity for high-quality samplings. To address this, various methods have been developed. Volume-based approaches like diffusion curvature by [9] and [10] heavily depend on accurate distance estimations. Laplace–Beltrami operator-based approaches, as explored in [11] and [9] encounter limitations in accurately estimating curvature from small sample sizes. Second Fundamental Form-based approaches, as proposed in [11], demonstrate relatively high-quality curvature estimation for scalar curvature. However, they rely on fixed parameters for neighborhood selection. We introduce adaptability into the estimation process, addressing the challenges associated with variable data density and the absence of intrinsic curvature information by dynamically adjusting parameters based on the local properties of the manifold. This ensures robustness across diverse manifolds. Our main contributions are:

  • We estimate the point-wise Gaussian curvature of point clouds and their underlying principal curvatures, i.e. How much the data curves and in which directions it curves the most.

  • We dynamically adjust neighborhood scales for local PCA and curvature estimation based on the explained variance ratio. This ensures accurate predictions without requiring hand-tuning of the parameters.

  • We demonstrate the fidelity of our method relative to ground truth (Gaussian and mean) curvatures on canonical 2-manifolds.

  • We illustrate its application to single-cell data analysis, where principal curvatures suggest the directions of cell differentiation.

II Methods

For differential geometry preliminaries, we refer the reader to [12], as an extensive introduction to this topic would be beyond the scope of this work and its succint presentation.

II-A Local PCA

Our method starts with Local PCA as described in [13]. Given a point cloud x1,,xmx_{1},\cdots,x_{m}, we select a neighborhood 𝒩xi,ϵPCA:={xj:0<xjxi<ϵPCA}\mathcal{N}_{x_{i},\epsilon_{\text{PCA}}}:=\{x_{j}:0<\|x_{j}-x_{i}\|<\epsilon_{\text{PCA}}\} around each point xix_{i} for a hyperparameter ϵPCA>0\epsilon_{\text{PCA}}>0 that has to be determined. Each data matrix containing the neighbors of a point xix_{i} is shifted to be centered around xix_{i} to get a matrix Xi=[xi1xi,,xiNixi]X_{i}=\left[x_{i_{1}}-x_{i},\ldots,x_{i_{N_{i}}}-x_{i}\right] where Ni:=|𝒩xi,ϵPCA|N_{i}:=|\mathcal{N}_{x_{i},\epsilon_{\text{PCA}}}|. Then, the columns of XiX_{i} are rescaled to Bi=XiDiB_{i}=X_{i}D_{i} by applying a diagonal weighting matrix DiD_{i} to emphasize the importance of local data. Finally, SVD decomposition yields a numerical approximation of the tangent plane. Since, we are interested mainly in surfaces, the first two eigenvectors are selected as a basis for the local tangent space and the third one serves as a normal vector to the surface.

II-B Adaptive Local PCA and parameter selection

AdaL-PCA uses the explained variance ratio for the first two singular values given by

ρ(r):=i=12σi(r)2i=13σi(r)2\rho(r):=\frac{\sum_{i=1}^{2}\sigma_{i}(r)^{2}}{\sum_{i=1}^{3}\sigma_{i}(r)^{2}} (1)

to select a suitable parameter ϵPCA\epsilon_{\text{PCA}}. This ratio describes the fraction of data variance captured by the tangent plane approximated by the span of the first two singular vectors. We set a threshold γ\gamma for the ratio ρ(r)\rho(r) and compute the largest rr-neighborhood that explains a fraction γ\gamma of the data variance. That is,

ϵPCA:=max{r|ρ(r)>γ}.\epsilon_{\text{PCA}}:=\max\left\{r\ \big{|}\rho(r)>\gamma\right\}. (2)
Algorithm 1 Adaptive Local PCA (AdaL-PCA)
  Input: Point cloud data x1,,xm3x_{1},\ldots,x_{m}\in\mathbb{R}^{3}, query point pp, kernel function KK with supports in [0,1], data bound δ\delta (maximum pairwise distance in data), ratio bound ρ0(0,1)\rho_{0}\in(0,1) for choosing size of PCA neighborhood.
  for r(0,0.2δ]r\in(0,0.2\delta] do
     (𝒩p,r,𝐃𝐫){(q,qp):0<qp<r}(\mathcal{N}_{p,r},\mathbf{D_{r}})\leftarrow\{(q,\|q-p\|):0<\|q-p\|<r\}
     𝐗𝒩p,rp\mathbf{X}\leftarrow\mathcal{N}_{p,r}-p
     𝐃diag(K(𝐃𝐫/r))\mathbf{D}\leftarrow\text{diag}(\sqrt{K(\mathbf{D_{r}}/r)})
     𝐁𝐃𝐗\mathbf{B}\leftarrow\mathbf{D}\mathbf{X}
     𝐔𝚺𝐕TSVD(𝐁)\mathbf{U}\mathbf{\Sigma}\mathbf{V}^{T}\leftarrow\text{SVD}(\mathbf{B})
     ρ(r){i=12σi2/i=13σi2:σi𝚺}\rho(r)\leftarrow\left\{\sum_{i=1}^{2}\sigma_{i}^{2}\big{/}\sum_{i=1}^{3}\sigma_{i}^{2}:\sigma_{i}\in\mathbf{\Sigma}\right\}
  end for
  ϵPCAmax({r:ρ(r)>ρ0})\epsilon_{\text{PCA}}\leftarrow\max(\{r:\rho(r)>\rho_{0}\})
  τargminr{ρ(r)}\tau\leftarrow\text{argmin}_{r}\{\rho(r)\}
  Return: ϵPCA\epsilon_{\text{PCA}}, τ\tau

We use a similar method to select a radius τi\tau_{i} for estimating the curvature around each data point xix_{i}. In this case, we need a neighborhood large enough to capture the “bending” of the surface. This is done by computing the lowest value reached by the graph of the explained variance ratio ρ(r)\rho(r),

τ:=argminr{ρ(r)}.\tau:=\arg\min_{r}\left\{\rho(r)\right\}. (3)
Refer to caption
Figure 1: Comparison of the explained variance ratio of the top two singular values and accuracy (RMSE) of Gaussian curvature estimation w.r.t. increasing radii of ϵ\epsilon-neighborhood and τ\tau-neighborhood around pp on torus.

Refer to caption

Figure 2: Directional curvatures in an ϵ\epsilon-PCA neighborhood of pp.

As illustrated in Fig.1 at a point pp on a torus, this approach is motivated by the fact that as the τ\tau-neighborhood increases past a certain threshold, the variance in data can no longer be explained by the selected tangent plane. We refer the reader to Algorithm 1 for a summary of AdaL-PCA’s key steps and emphasize that both ϵPCA\epsilon_{\text{PCA}} and τ\tau are adjusted at each data point to capture the local geometry.

II-C Curvature Estimation

Algorithm 2 Estimation for Principal Curvature, Gaussian Curvature, and Mean Curvature
  Input: Point cloud data x1,,xm3x_{1},\ldots,x_{m}\in\mathbb{R}^{3}, query point pp, kernel function KK with supports in [0,1], the pair (ϵPCA,τ)(\epsilon_{\text{PCA}},\tau), the percentage p(0,1)p\in(0,1) of total number of points for which the largest (smallest) directional curvature κ1\kappa_{1} (κ2\kappa_{2}) is computed.
  (𝒩p,ϵPCA,𝐃ϵPCA){(q,qp):0<qp<ϵPCA}(\mathcal{N}_{p,\epsilon_{\text{PCA}}},\mathbf{D_{\epsilon_{\text{PCA}}}})\leftarrow\{(q,\|q-p\|):0<\|q-p\|<\epsilon_{\text{PCA}}\}
  𝐗𝒩p,ϵPCAp\mathbf{X}\leftarrow\mathcal{N}_{p,\epsilon_{\text{PCA}}}-p
  𝐃diag(K(𝐃ϵPCA/ϵPCA))\mathbf{D}\leftarrow\text{diag}(\sqrt{K(\mathbf{D_{\epsilon_{\text{PCA}}}}/\epsilon_{\text{PCA}})})
  𝐁𝐃𝐗\mathbf{B}\leftarrow\mathbf{D}\mathbf{X}
  𝐔𝚺𝐕TSVD(𝐁)\mathbf{U}\mathbf{\Sigma}\mathbf{V}^{T}\leftarrow\text{SVD}(\mathbf{B})
  𝐎𝐔[:3,:]\mathbf{O}\leftarrow\mathbf{U}[\ :3,:\ ]
  𝒩p,τ{q:0<qp<τ}\mathcal{N}_{p,\tau}\leftarrow\{q:0<\|q-p\|<\tau\}
  for q𝒩p,τq\in\mathcal{N}_{p,\tau} do
     vqqpv_{q}\leftarrow q-p
     κq2(𝐎[2]vq)/vq2\kappa_{q}\leftarrow 2(\mathbf{O}[2]\cdot v_{q})/||v_{q}||^{2} {by equation 4}
     wqK(vq/τ)w_{q}\leftarrow K(v_{q}/\tau)
  end for
  Csort{(κq,wq):κq in ascending order}\text{C}\leftarrow\text{sort}\{(\kappa_{q},w_{q}):\kappa_{q}\text{ in ascending order}\}
  kint(plen(C))k\leftarrow\text{int}(p\cdot\text{len}(\text{C}))
  κ1sum({κqwq:(κq,wq)C[:k]})/sum({wq})\kappa_{1}\leftarrow\text{sum}(\{\kappa_{q}\cdot w_{q}:(\kappa_{q},w_{q})\in\text{C}[:k]\})/\text{sum}(\{w_{q}\})
  κ2sum({κqwq:(κq,wq)C[k:]})/sum({wq})\kappa_{2}\leftarrow\text{sum}(\{\kappa_{q}\cdot w_{q}:(\kappa_{q},w_{q})\in\text{C}[-k:]\})/\text{sum}(\{w_{q}\})
  Kpκ1κ2K_{p}\leftarrow\kappa_{1}\cdot\kappa_{2}
  Hpκ1+κ2H_{p}\leftarrow\kappa_{1}+\kappa_{2}
  Return: κ1\kappa_{1}, κ2\kappa_{2}, KpK_{p}, HpH_{p}

The directional curvatures κi(T)\kappa_{i}(T) at a point xix_{i} in a direction TT are approximated (see for instance [14]) by

κi(T)2N.TT2+O(t).\kappa_{i}(T)\approx\frac{2N.T}{\lVert T\rVert^{2}}+O(t). (4)

Here TT is replaced by the entries of XiX_{i} in the proper τi\tau_{i}-neighborhood and NN is the orthonormal vector to the frame obtained by local PCA. The principal curvatures κ1\kappa_{1} and κ2\kappa_{2} correspond, respectively, to the highest and lowest values of the directional curvatures.111Note that this is sometimes taken as a definition of principal curvatures. In practice, we select a percentage (20%) of the highest (respectively, lowest) curvatures and average them (using a Gaussian kernel) to approximate κ1\kappa_{1} (respectively, κ2\kappa_{2}). By selecting the directional vectors TT corresponding to the highest curvatures κT\kappa_{T}, this averaging yields principal directions, while we obtain Gaussian curvature by the product κ1κ2\kappa_{1}\kappa_{2}. The time complexity of our current implementation is O(nτm(m2+log nτ))O(n_{\tau}m(m^{2}+\text{log }n_{\tau})), where nτn_{\tau} is an upper bound on the cardinality of the τ\tau-neighborhoods (in general nτmn_{\tau}\ll m). This can be improved significantly in practice with fast PCA algorithms for scalability [15], [16].

III Results and discussion

Our main contribution is the estimation of the principal curvatures and principal directions. We mainly focus on the application of the principal curvatures and principal directions to biological data and identify key properties and changes in the geometry of these datasets.

We validate the accuracy of our principal curvature estimation by computing Gaussian curvature on toy datasets. Moreover, we apply our estimation of principal curvature and Gaussian curvature for single-cell RNA sequencing data (scRNA-seq).222Implementation details and some examples can be found at https://github.com/LydiaMez/AdaL-PCA.git. Gaussian curvature gives the “intensity” for the differentiation of cell states, and principal directions give the directions for the split of the cell lineages.

III-A Estimation on Sampled Surfaces

We compare AdaL-PCA’s estimates of Gaussian curvature against two contemporary methods, Hickok & Blumberg [10] and Diffusion Curvature [9]. We also quantify AdaL-PCA’s recovery of ground-truth mean curvature as a validation of the fidelity of its principal curvatures.

Refer to caption
Figure 3: Comparison of AdaL-PCA against ground truth for mean curvature on three toy datasets. Corr stands for Pearson correlation and RMSE stands for the root means squared error.

To assess the ability of various models to recover the Gaussian and mean curvatures, we generate datasets from three canonical 2-dimensional manifolds: the torus, ellipsoid, and the hyperbolic paraboloid (saddle). Tables I and II were generated from 5000 points sampled uniformly from these surfaces. To study the robustness of each method to noise, we corrupt each point:

xi~=xi+ϵi\tilde{x_{i}}=x_{i}+\epsilon_{i}

where each ϵ1,ϵNiid𝒩(0,σ)\epsilon_{1},\ldots\epsilon_{N}\overset{\mathrm{iid}}{\sim}\mathcal{N}(0,\sigma) and σ\sigma between 0.0 and 0.5.

TABLE I: Root Mean Square Error (RMSE) and Energy Distance (Eng. Dist) of Gaussian curvature estimation for different noise levels. For RMSE and Eng. Dist, smaller is better.
data noise Ours H. & B.
RMSE Eng.Dist RMSE Eng.Dist
0.0 0.462 0.462 1.302 0.646
0.1 1.391 0.725 7.489 3.076
Torus 0.2 2.023 1.026 15.914 4.772
0.3 2.056 1.071 19.143 5.171
0.4 2.060 1.076 19.971 5.168
0.5 2.048 1.059 19.944 5.041
0.0 0.430 0.251 0.388 0.361
0.1 0.849 0.277 6.730 3.407
Ellipsoid 0.2 0.564 0.832 15.647 5.075
0.3 1.760 1.296 20.135 5.576
0.4 1.988 1.565 21.007 5.541
0.5 2.061 1.643 20.852 5.391
0.0 0.293 0.321 2.025 1.154
0.1 0.400 0.405 4.032 1.981
Hyperbolic 0.2 0.567 0.538 10.077 3.588
paraboloid 0.3 0.673 0.674 12.829 4.065
0.4 0.753 0.757 13.532 4.069
0.5 0.776 0.787 13.230 3.908

Note that both Hickok & Blumberg’s method and Diffusion Curvature require manually specified parameters, which must be tuned for each dataset. By contrast, AdaL-PCA’s heuristics adapt the method to each dataset. This results in an improved performance observed in Fig. I and Fig. II. Diffusion Curvature is an unsigned measure of local curvature for point clouds sampled from a manifold. Although it differs from Gaussian curvature, numerical experiments detailed in [9] suggest a correlation. Therefore, we report only the Pearson correlation for diffusion curvature.

TABLE II: Pearson Correlation Coefficient (Pearson Corr.) of Gaussian curvature estimation for different noise levels.
data noise Ours Diffusion Curvature
0.0 0.996 0.445
0.1 0.865 0.270
Torus 0.2 0.633 0.304
0.3 0.550 0.308
0.4 0.440 0.273
0.5 0.408 0.243
0.0 0.988 0.149
0.1 0.325 0.057
Ellipsoid 0.2 0.124 0.002
0.3 -0.153 0.017
0.4 -0.131 0.044
0.5 -0.018 0.048
0.0 0.747 0.398
0.1 0.603 0.333
Hyperbolic 0.2 0.481 0.282
paraboloid 0.3 0.428 0.336
0.4 0.386 0.342
0.5 0.363 0.327

III-B Curvature estimation for single-cell data

Refer to caption
Figure 4: Gaussian curvature and principal directions of embryonic stem cell differentiation. (A) PHATE visualization of scRNA-seq data color-coded by time intervals. (B) PHATE plot colored by Gaussian curvature values. (C, D) Principal directions at different stages of development of cells.
Refer to caption
Figure 5: Gaussian curvature and principal directions on IPSC dataset using AdaL-PCA.

We apply our model to single-cell data for cell state differentiation direction discovery. We use RNA sequencing data for human embryonic stem cells available at [17], collected over 27 days during which cells start as embryonic stem cells and then progressively differentiate into different cellular lineages. Low-dimensional manifold visualization of this data using PHATE (Fig. 4, A) shows that embryonic cells (days 0-3, displayed in red) branch into two lineages: endoderm (upper split) and ectoderm (lower split) around day 6. Further differentiation occurs during days 12-27. This is reflected in Fig. 4B with relatively constant zero curvature values at days 0-3 and a transition into a region of high variations in curvature. We observe starting from day 3 a transition into very low negative values of curvature and then a rapid progression into higher values close to zero as we approach day 27. This is consistent with the fact that the region 0-3 days corresponds to the stem state and the region 12-27 to the differentiated state. In addition to the signed curvature that provides a better appreciation of the cellular differentiation into several lineages (cell types), the principal directions in Fig. 4C, D obtained from projecting the three-dimensional principal directions using the PHATE embedding allow us to track the state towards which the cells differentiate, adding directional information.

We estimated the curvature of a publicly available single-cell induced pluripotent stem cell (iPSC) reprogramming. In this dataset, mass cytometry is used to quantitatively measure 33 protein biomarkers in 2005 mouse fibroblast cells induced to undergo reprogramming into stem cell state. Low-dimensional PHATE visualization of this data shows fibroblasts progressing to a point of divergence where two lineages emerge, one that successfully undergoes reprogramming and another that undergoes apoptosis (cell death). Our model correctly identifies the initial branching point as having negative values of Gaussian curvature indicating saddle-like divergent paths out of the branching point (Fig. 5). Moreover, the principal directions on the diverging branch correctly identify the directions in which the cell lineages diverge.

IV Conclusion

We introduced Adaptive Local PCA (AdaL-PCA), a novel method for estimating intrinsic curvature on data manifolds, with a focus on principal curvatures and directions. By dynamically adjusting neighborhood scales based on the explained variance ratio, AdaL-PCA provides robust and accurate curvature estimates without requiring manual parameter tuning. This adaptability effectively handles variations in data density and the lack of prior curvature information, making it ideal for complex, diverse datasets. We validated AdaL-PCA on synthetic surfaces, demonstrating its ability to recover Gaussian and mean curvatures even in noisy settings. Additionally, we applied it to human embryonic single-cell RNA sequencing data, revealing key directions of cellular differentiation and providing biologically meaningful insights. These results highlight AdaL-PCA’s potential in both geometric data analysis and practical applications like single-cell studies. Future work may extend the method to higher dimensions for scalar curvature estimation and improve efficiency by integrating neural network-based local distribution estimation or exploring alternative local PCA frameworks.

References

  • [1] Y. L. Xin, “Curvature estimates for submanifolds with prescribed gauss image and mean curvature,” Calculus of Variations and Partial Differential Equations, vol. 37, no. 3, pp. 385–405, Mar 2010. [Online]. Available: https://doi.org/10.1007/s00526-009-0268-8
  • [2] T. Surazhsky, E. Magid, O. Soldea, G. Elber, and E. Rivlin, “A comparison of gaussian and mean curvatures estimation methods on triangular meshes,” in 2003 IEEE International Conference on Robotics and Automation (Cat. No.03CH37422), vol. 1, 2003, pp. 1021–1026 vol.1.
  • [3] P. Flynn and A. Jain, “On reliable curvature estimation,” in Proceedings CVPR ’89: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1989, pp. 110–116.
  • [4] C. Zhao, D. Zhao, and Y. Chen, “Simplified gaussian and mean curvatures to range image segmentation,” in Proceedings of 13th International Conference on Pattern Recognition, vol. 2, 1996, pp. 427–431 vol.2.
  • [5] S. Rusinkiewicz, “Estimating curvatures and their derivatives on triangle meshes,” in Proceedings. 2nd International Symposium on 3D Data Processing, Visualization and Transmission, 2004. 3DPVT 2004.   IEEE, 2004, pp. 486–493.
  • [6] G. Bella, G. Bruno, and A. Santoro, “Ball pivoting algorithm and discrete gaussian curvature: A direct way to curved nanographene circularly polarized luminescence spectral simulation,” FlatChem, vol. 42, p. 100567, 2023. [Online]. Available: https://www.sciencedirect.com/science/article/pii/S2452262723000995
  • [7] J. Hou, Y. Li, Z. Sun, H. Wang, M. Lu, J. Hu, and H. Wu, “A cooperative regulation method for greenhouse soil moisture and light using gaussian curvature and machine learning algorithms,” Computers and Electronics in Agriculture, vol. 215, p. 108452, 2023. [Online]. Available: https://www.sciencedirect.com/science/article/pii/S0168169923008402
  • [8] W. He, Z. Jiang, C. Zhang, and A. M. Sainju, “Curvanet: Geometric deep learning based on directional curvature for 3d shape analysis,” in Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, ser. KDD ’20.   New York, NY, USA: Association for Computing Machinery, 2020, p. 2214–2224. [Online]. Available: https://doi.org/10.1145/3394486.3403272
  • [9] D. Bhaskar, K. MacDonald, O. Fasina, D. S. Thomas, B. Rieck, I. Adelstein, and S. Krishnaswamy, “Diffusion curvature for estimating local curvature in high dimensional data,” in Advances in Neural Information Processing Systems, A. H. Oh, A. Agarwal, D. Belgrave, and K. Cho, Eds., 2022. [Online]. Available: https://openreview.net/forum?id=AYII8AkvD1e
  • [10] A. Hickok and A. J. Blumberg, “An intrinsic approach to scalar-curvature estimation for point clouds,” arXiv preprint arXiv:2308.02615, 2023.
  • [11] D. Sritharan, S. Wang, and S. Hormoz, “Computing the riemannian curvature of image patch and single-cell rna sequencing data manifolds using extrinsic differential geometry,” Proceedings of the National Academy of Sciences, vol. 118, no. 29, p. e2100473118, 2021.
  • [12] M. P. do Carmo, Differential geometry of curves & surfaces, 2nd ed.   Dover Publications, Inc., Mineola, NY, 2016.
  • [13] A. Singer and H.-T. Wu, “Vector diffusion maps and the connection laplacian,” Communications on pure and applied mathematics, vol. 65, no. 8, pp. 1067–1144, 2012.
  • [14] G. Taubin, “Estimating the tensor of curvature of a surface from a polyhedral approximation,” in Proceedings of IEEE International Conference on Computer Vision.   IEEE, 1995, pp. 902–907.
  • [15] S. L. Shishkin, A. Shalaginov, and S. D. Bopardikar, “Fast approximate truncated svd,” Numerical Linear Algebra with Applications, vol. 26, no. 4, p. e2246, 2019.
  • [16] C. Xu, W. Xu, and K. Jing, “Fast algorithms for singular value decomposition and the inverse of nearly low-rank matrices,” National Science Review, vol. 10, no. 6, p. nwad083, 2023.
  • [17] K. Moon, “Embryoid body data for phate,” Mendeley Data, vol. 1, 2018.