By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
World of SoftwareWorld of SoftwareWorld of Software
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Search
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
Reading: HSVM Decision Boundaries: Visualizing PGD vs. SDP and Moment Relaxation | HackerNoon
Share
Sign In
Notification Show More
Font ResizerAa
World of SoftwareWorld of Software
Font ResizerAa
  • Software
  • Mobile
  • Computing
  • Gadget
  • Gaming
  • Videos
Search
  • News
  • Software
  • Mobile
  • Computing
  • Gaming
  • Videos
  • More
    • Gadget
    • Web Stories
    • Trending
    • Press Release
Have an existing account? Sign In
Follow US
  • Privacy
  • Terms
  • Advertise
  • Contact
Copyright © All Rights Reserved. World of Software.
World of Software > Computing > HSVM Decision Boundaries: Visualizing PGD vs. SDP and Moment Relaxation | HackerNoon
Computing

HSVM Decision Boundaries: Visualizing PGD vs. SDP and Moment Relaxation | HackerNoon

News Room
Last updated: 2026/01/17 at 6:44 PM
News Room Published 17 January 2026
Share
HSVM Decision Boundaries: Visualizing PGD vs. SDP and Moment Relaxation | HackerNoon
SHARE

Table of Links

Abstract and 1. Introduction

  1. Related Works

  2. Convex Relaxation Techniques for Hyperbolic SVMs

    3.1 Preliminaries

    3.2 Original Formulation of the HSVM

    3.3 Semidefinite Formulation

    3.4 Moment-Sum-of-Squares Relaxation

  3. Experiments

    4.1 Synthetic Dataset

    4.2 Real Dataset

  4. Discussions, Acknowledgements, and References

A. Proofs

B. Solution Extraction in Relaxed Formulation

C. On Moment Sum-of-Squares Relaxation Hierarchy

D. Platt Scaling [31]

E. Detailed Experimental Results

F. Robust Hyperbolic Support Vector Machine

E Detailed Experimental Results

E.1 Visualizing Decision Boundaries

Here we visualize the decision boundary of for PGD, SDP relaxation and sparse moment-sum-ofsquares relaxation (Moment) on one fold of the training to provide qualitative judgements.

We first visualize training on the first fold for Gaussian 1 dataset from Figure 3 in Figure 5. We mark the train set with circles and test set with triangles, and color the decision boundary obtained by three methods with different colors. In this case, note that SDP and Moment overlap and give identical decision boundary up to machine precision, but they are different from the decision boundary of PGD method. This slight visual difference causes the performance difference displayed in Table 1.

We next visualize the decision boundary for tree 2 from Figure 3 in Figure 6. Here the difference is dramatic: we visualize both the entire data in the left panel and the zoomed-in one on the right. We indeed observe that the decision boundary from moment-sum-of-squares relaxation have roughly equal distance from points to the grey class and to the green class, while SDP relaxation is suboptimal in that regard but still enclosing the entire grey region. PGD, however, converges to a very poor local minimum that has a very small radius enclosing no data and thus would simply classify all data sample to the same class, since all data falls to one side of the decision

boundary. As commented in Section 4, data imbalance is to blame, in which case the final converged solution is very sensitive to the choice of initialization and other hyperparameters such as learning rate. This is in stark contrast with solving problems using the interior point method, where after implementing into MOSEK, we are essentially care-free. From this example, we see that empirically sparse moment-sum-of-squares relaxation finds linear separator of the best quality, particularly in cases where PGD is expected to fail.

Figure 6: Decision boundary visualization of the train test split from the first fold. The left panel shows all the data and the right panel zooms in to the decision boundary. PGD gets stuck in a bad local minima (a tiny circle in the right panel) and thus classify all data samples to one class. While both SDP and moment relaxation give a decision boundary that demarcate one class from another, Moment has roughly equal margin to samples from the grey class and to samples from the green class, which is preferred in large-margin learning.

E.2 Synthetic Gaussian

To generate mixture of Gaussian in hyperbolic space, we first generate them in Euclidean space, with the center coordinates independently drawn from a standard normal distribution. 𝐾 such centers are drawn for defining 𝐾 different classes. Then we sample isotropic Gaussian at respective center with scale 𝑠. Finally, we lift the generated Gaussian mixtures to hyperbolic spaces using exp0 . For simplicity, we only present results for the extreme values: 𝐾 ∈ {2, 5}, 𝑠 ∈ {0.4, 1}, and 𝐶 ∈ {0.1, 10}.

For each method (PGD, SDP, Moment), we compute the train/test accuracy, weighted F1 score, and loss on each of the 5 folds of data for a specific (𝐾, 𝑠, 𝐶) configuration. We then average these metrics across the 5 folds, for all methods and configurations. To illustrate the performance, we plot the improvements of the average metrics of the Moment and SDP methods compared to PGD as bar plots for 15 different seeds. Outliers beyond the interquartile range (Q1 and Q3) are excluded for clarity, and a zero horizontal line is marked for reference. Additionally, to compare the Moment and SDP methods, we compute the average optimality gaps similarly, defined in Equation (15), and present them as bar plots. Our analysis begins by examining the train/test accuracy and weighted F1 score of the PGD, SDP, and Moment methods across various synthetic Gaussian configurations, as shown in Figures 7 to 10.

Figure 7: Train/test accuracy and train/test f1 improvements compared to PGD across various 𝐶 ∈ {0.1, 10} for 𝐾 = 2 and 𝑠 = 0.4

Figure 8: Train/test accuracy and train/test f1 improvements compared to PGD across various 𝐶 ∈ {0.1, 10} and 𝐶 for 𝐾 = 2 and 𝑠 = 1.0

Figure 9: Train/test accuracy and train/test f1 improvements compared to PGD across various 𝐶 ∈ {0.1, 10} for 𝐾 = 5 and 𝑠 = 0.4

Figure 10: Train/test accuracy and train/test f1 improvements compared to PGD across various 𝐶 ∈ {0.1, 10} for 𝐾 = 2 and 𝑠 = 1.0

Across various configurations, we observe that both the Moment and SDP methods generally show improvements over PGD in terms of train and test accuracy as well as weighted F1 score. Notably, we observe that Moment method often shows more consistent improvements compared to SDP. This consistency is evident across different values of (𝐾, 𝑠, 𝐶), suggesting that the Moment method is more robust and provide more generalizable decision boundaries. Moreover, we observe that 1. for larger number of classes (i.e. larger 𝐾), the Moment method consistently and significantly outperforms both SDP and PGD, highlighting its capability to manage complex class structures efficiently; and 2. for simpler datasets (with smaller scale 𝑠), both Moment and SDP methods generally outperform PGD, where the Moment method particularly shows a promising performance advantage over both PGD and SDP.

Figure 11: Train/test loss improvements compared to PGD and optimality gaps comparison across various 𝐶 ∈ {0.1, 10} for 𝐾 = 2 and 𝑠 = 0.4

Figure 12: Train/test loss improvements compared to PGD and optimality gaps comparison across various 𝐶 ∈ {0.1, 10} for 𝐾 = 2 and 𝑠 = 1

Figure 13: Train/test loss improvements compared to PGD and optimality gaps comparison across various 𝐶 ∈ {0.1, 10} for 𝐾 = 5 and 𝑠 = 0.4

Figure 14: Train/test loss improvements compared to PGD and optimality gaps comparison across various 𝐶 ∈ {0.1, 10} for 𝐾 = 5 and 𝑠 = 1

Next, we move to examine the train/test loss improvements compared to PGD and optimality gaps comparison across various configurations, shown in Figures 11 to 14. We observe that for 𝐾 = 5, the Moment method achieves significantly smaller losses compared to both PGD and SDP, which aligns with our previous observations on accuracy and weighted F1 scores. However, for 𝐾 = 2, the losses of the Moment and SDP methods are generally larger than PGD’s. Nevertheless, it is important to note that these losses are not direct measurements of our optimization methods’ quality; rather, they measure the quality of the extracted solutions. Therefore, a larger loss does not necessarily imply that our optimization methods are inferior to PGD, as the heuristic extraction methods might significantly impact the loss. Additionally, we observe that the optimality gaps of the Moment method are significantly smaller than those of the SDP method, suggesting that Moment provides better solutions. Interestingly, the optimality gaps of the Moment method also exhibit smaller variance compared to SDP, as indicated by the smaller boxes in the box plots, further supporting the consistency and robustness of the Moment method.

Lastly, we compare the computational efficiency of these methods, where we compute the average runtime to finish 1 fold of training for each model on synthetic dataset, shown in Table 4. We observe that sparse moment relaxation typically requires at least one order of magnitude in runtime compared to other methods, which to some extent limits the applicability of this method to large scale dataset.

Table 4: Average runtime to finish 1 fold of training for each model on synthetic dataset.

E.3 Real Data

In this section we provide detailed performance breakdown by the choice of regularization 𝐶 for both one-vs-one and one-vs-rest scheme in Tables 5 to 10.

Table 5: Real dataset performance (𝐶 = 0.1), one-vs-rest

Table 6: Real dataset performance (𝐶 = 1.0), one-vs-rest

Table 7: Real dataset performance (𝐶 = 10.0), one-vs-rest

In one-vs-rest scheme, we observe that the Moment method consistently outperforms both PGD and SDP across almost all datasets and 𝐶 in terms of accuracy and F1 scores. Notably, the optimality gaps, 𝜂, for Moment are consistently lower than those for SDP, indicating that the Moment method’s solution obtain a better gap, which underscore the effectiveness of the Moment method in real datasets.

Table 8: Real dataset performance (𝐶 = 0.1), one-vs-one

Table 9: Real dataset performance (𝐶 = 1.0), one-vs-one

Table 10: Real dataset performance (𝐶 = 10.0), one-vs-one

In one-vs-one scheme however, we observe that the SDP and Moment have comparative performances, both better than PGD. Nevertheless, the optimality gaps of SDP are still significantly larger than the Moment’s, for almost all cases.

Similarly, we compare the average runtime to finish 1 fold of training for each model on these real datasets, shown in Table 11. We observe a similar trend: the sparse moment relaxation typically requires at least an order of magnitude more runtime compared to the other methods.

Table 11: Average runtime to finish 1 fold of training for each model on real dataset.

:::info
Authors:

(1) Sheng Yang, John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA ([email protected]);

(2) Peihan Liu, John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA ([email protected]);

(3) Cengiz Pehlevan, John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, Center for Brain Science, Harvard University, Cambridge, MA, and Kempner Institute for the Study of Natural and Artificial Intelligence, Harvard University, Cambridge, MA ([email protected]).

:::


:::info
This paper is available on arxiv under CC by-SA 4.0 Deed (Attribution-Sharealike 4.0 International) license.

:::

Sign Up For Daily Newsletter

Be keep up! Get the latest breaking news delivered straight to your inbox.
By signing up, you agree to our Terms of Use and acknowledge the data practices in our Privacy Policy. You may unsubscribe at any time.
Share This Article
Facebook Twitter Email Print
Share
What do you think?
Love0
Sad0
Happy0
Sleepy0
Angry0
Dead0
Wink0
Previous Article 'Landman' Season 2 Finale: Streaming Release Date and Time 'Landman' Season 2 Finale: Streaming Release Date and Time
Next Article Apple's 1TB M5 MacBook Pro hits ,599 with new B&H Photo deal Apple's 1TB M5 MacBook Pro hits $1,599 with new B&H Photo deal
Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Stay Connected

248.1k Like
69.1k Follow
134k Pin
54.3k Follow

Latest News

Optimizing Software With Zero-Copy And Other Techniques
Optimizing Software With Zero-Copy And Other Techniques
Software
5 Phone Carriers To Avoid At All Costs, According To Consumer Reports – BGR
5 Phone Carriers To Avoid At All Costs, According To Consumer Reports – BGR
News
First vivo and OPPO, now another Android brand will offer an external camera lens
First vivo and OPPO, now another Android brand will offer an external camera lens
News
How to watch Australian Open 2026 for free online from anywhere: TV schedule
How to watch Australian Open 2026 for free online from anywhere: TV schedule
News

You Might also Like

AI Coding Tip 003 – Force Read-Only Planning | HackerNoon
Computing

AI Coding Tip 003 – Force Read-Only Planning | HackerNoon

5 Min Read
Ethereum Targets ,000—But PEPETO Could Deliver 10,000% More Upside | HackerNoon
Computing

Ethereum Targets $7,000—But PEPETO Could Deliver 10,000% More Upside | HackerNoon

0 Min Read
Important AMDGPU & AMDKFD Driver Improvements Readied For Linux 6.20~7.0
Computing

Important AMDGPU & AMDKFD Driver Improvements Readied For Linux 6.20~7.0

2 Min Read
The AI Engine is the New Artist: Rethinking Royalties in an Age of Infinite Content | HackerNoon
Computing

The AI Engine is the New Artist: Rethinking Royalties in an Age of Infinite Content | HackerNoon

5 Min Read
//

World of Software is your one-stop website for the latest tech news and updates, follow us now to get the news that matters to you.

Quick Link

  • Privacy Policy
  • Terms of use
  • Advertise
  • Contact

Topics

  • Computing
  • Software
  • Press Release
  • Trending

Sign Up for Our Newsletter

Subscribe to our newsletter to get our newest articles instantly!

World of SoftwareWorld of Software
Follow US
Copyright © All Rights Reserved. World of Software.
Welcome Back!

Sign in to your account

Lost your password?