∎
e-mail: [email protected], [email protected]
ParticleNet and its application on CEPC Jet Flavor Tagging
Abstract
Quarks, except top quark, and gluon would hadronize and fragment into a spray of stable particles, called jet. Identification of quark flavor is essential for collider experiments in high-energy physics, relying on flavor tagging algorithms. In this study, using a full simulation of the Circular Electron Positron Collider (CEPC), we investigated the flavor tagging performance of two different algorithms: ParticleNet, based on Graph Neural Network, and LCFIPlus, based on the Gradient Booted Decision Tree. Compared to LCFIPlus, ParticleNet significantly enhances flavor tagging performance, resulting in a significant improvement in benchmark measurement accuracy, i.e., a 36% improvement for measurement and a 75% improvement for measurement via W boson decay, respectively, when CEPC operates as a Higgs factory at the center-of-mass energy of 240 GeV and integrated luminosity of 5.6 . We compared the performance of ParticleNet and LCFIPlus at different vertex detector configurations, observing that the inner radius is the most sensitive parameter, followed by material budget and spatial resolution.
Keywords:
CEPC Jet Flavor Tagging ParticleNet1 Introduction
A jet refers to a spray of stable particles formed through the hadronization of an energetic quark or gluon. The W/Z/Higgs boson and the top quark, the four most massive Standard Model (SM) particles, decay mainly into quarks and hadronize to jets. Figure 1 illustrates a reconstructed event with center-of-mass energy of 91.2 GeV. Efficient identification of the jet flavor could shed light on the properties of those massive particles and is critical for experimental exploration at the high-energy frontier. Flavor tagging is used to distinguish jets hadronized from different quark flavors or gluon. To promote the development of future electron-positron Higgs factories, which is regarded as the highest priority for the next collider ES , accurate performance analysis and optimization of both detectors and algorithms are essential. Jet flavor tagging and relevant benchmark analyses serve as good objectives.

Operation mode | Z factory | WW | Higgs factory | |
(GeV) | 91.2 | 160 | 240 | 360 |
Run time (year) | 2 | 1 | 10 | 5 |
Instantaneous luminosity | 191.7 | 26.6 | 8.3 | 0.83 |
(, per IP) | ||||
Integrated luminosity | 100 | 6 | 20 | 1 |
(, 2 IPs) | ||||
Event yields | 3 1012 | 1 108 | 4 106 | 5 105 |
The Circular Electron Positron Collider (CEPC) CEPCStudyGroup:2018ghi is a large-scale collider facility proposed after the discovery of the Higgs boson in 2012. It is designed with a circumference of 100 km with two interaction points. It can operate at multiple center-of-mass energies, including 240 GeV as a Higgs factory, 160 GeV for the threshold scan, and 91 GeV as a Z factory. It also can be upgraded to 360 GeV for the threshold scan. Table 1 summarizes its baseline operating scheme and the corresponding boson yields Sun:2023lut . In the future, it can be upgraded to a proton-proton collider to directly explore new physics at a center-of-mass energy of about 100 TeV. The main scientific objective of the CEPC is the precise measurement of the Higgs properties, especially its coupling properties. Additionally, trillions of events can provide an excellent opportunity for studying flavor physics.
Jet flavor tagging performance depends on detector design, particularly the design of the vertex detector, as well as the utilization of reconstruction algorithms. In this study, we apply ParticleNet Qu:2019gqs to the CEPC and assess its flavor tagging performance in the measurement of and via W decay. Our results demonstrate that ParticleNet outperforms the baseline jet flavor tagging algorithm, LCFIPlus LCFIPlus , by achieving a 36% and 75% improvement in the relative statistical accuracy of and measurement via W boson decay at the center-of-mass energy of 240 GeV and integrated luminosity of 5.6 . We also observe that both ParticleNet and LCFIPlus perform better in the barrel region when compared to the endcap region. By analyzing the dependence of flavor tagging performance on vertex detector configurations, we observe that the most sensitive vertex detector parameter is the inner radius, followed by the material budget and spatial resolution. This result is consistent with previous studies conducted using LCFIPlus.
This article is organized as follows. Section 2 introduces the CEPC detector, software, and the samples used in this analysis. Section 3 describes the jet flavor tagging algorithms (LCFIPlus and ParticleNet) and the method used to evaluate their performance. Section 4 quantifies the dependence of flavor tagging performance on vertex detector configuration and compares the performance of ParticleNet and LCFIPlus. Finally, Section 5 provides a brief conclusion.
2 CEPC Detector, software, and samples

At present, two interaction points of CEPC are designed with the same baseline detector, which is designed according to the Particle Flow Algorithm (PFA) principle and emphasizes reconstructing visible final state particles in the most-suited detector subsystems. The structure of the CECP detector is shown in Fig. 2. From inner to outer, the baseline detector is composed of a silicon pixel vertex detector, a silicon inner tracker, a Time Projection Chamber (TPC) surrounded by a silicon external tracker, a silicon-tungsten sampling Electromagnetic Calorimeter (ECAL), a steel-glass Resistive Plate Chambers sampling Hadronic Calorimeter (HCAL), a 3 Tesla superconducting solenoid, and a flux return yoke embedded with a muon detector. For flavor tagging, the vertex detector is critical. At the CEPC, the vertex detector is designed with six concentric cylindrical layers of square silicon pixel sensors. The mechanical structure of the vertex detector consists of ladders, with each ladder supporting sensors on both sides. The detailed structure of the vertex detector is depicted in Fig. 3, and its specific parameters are listed in Table 2.

radius | spatial resolution | material budget | |
---|---|---|---|
(mm) | () | ||
Layer 1 | 16 | 2.8 | 0.15%/X0 |
Layer 2 | 18 | 6 | 0.15%/X0 |
Layer 3 | 37 | 4 | 0.15%/X0 |
Layer 4 | 39 | 4 | 0.15%/X0 |
Layer 5 | 58 | 4 | 0.15%/X0 |
Layer 6 | 60 | 4 | 0.15%/X0 |

A baseline reconstruction software chain has been developed to quantify the scientific merit and guide the detector optimization of CEPC, see Fig. 4. The data flow of the CEPC baseline software starts from the event generators of Whizard whizard and Pythia pythia . The detector geometry is implemented into MokkaPlus mokka , a GEANT4-based full simulation module. MokkaPlus calculates the energy deposition in the detector-sensitive volumes and creates simulated hits. For each sub-detector, the digitization module converts the simulated hits into digitized hits by convolution of the corresponding sub-detector responses. The reconstruction modules include the tracking, the Particle Flow, and the high-level reconstruction algorithms. The digitized tracker hits are reconstructed into tracks via the tracking algorithms track . The Particle Flow algorithm, Arbor arbor , reads the reconstructed tracks and the calorimeter hits to build reconstructed particles. High-level reconstruction algorithms reconstruct composite physics objects such as converted photons, jets, taus, and so on, and identify the flavor of the jets.
In this paper, we utilized hadronic events at Z-pole operation, including 1 million events, 1 million events, and 0.33 million each of events. For ParticleNet, we divided the samples into three distinct sets: the training set for training the model, the validation set used to validate whether the model is overfitting or underfitting, and the testing set used to give flavor tagging results. The ratios of samples in these sets were set at 60%, 20%, and 20%, respectively. For LCFIPlus, we use all samples to do the test since we have already trained the model.
3 Flavor tagging algorithms and their performance
In this section, we introduce LCFIPlus and ParticleNet and compare their performance based on the CEPC detector and software. Both algorithms read the information of reconstructed jet candidates and calculate the jet likeness to b, c, and light categories.
The LCFIPlus package, a framework for jet analysis in linear collider studies, was originally developed by the International Linear Collider (ILC) ILC , and has since been widely used at the Compact Linear Collider (CLIC) Robson:2018enq , the Future Circular Collider (FCC-ee) FCCCDR , and CEPC. The LCFIPlus package consists of vertex finding, jet clustering, vertex refinement, and flavor tagging. To perform flavor tagging, the jets are classified into four categories based on the number of reconstructed vertices and isolated leptons in the jet. A set of variables is then extracted for each category, which includes the number of tracks in each vertex, the vertex mass, the distance between the secondary vertex and the primary vertex, the vertex decay length, the track transverse momentum, and more. Further details can be found in LCFIPlus . In each category, two types of flavor tagging algorithms are trained using the Gradient Boosted Decision Tree (GBDT) method, one for the b-tagging algorithm and the other for the c-tagging algorithm.


Variable | Definition |
---|---|
difference in pseudorapidity between the particle and the jet axis | |
difference in azimuthal angle between the particle and the jet axis | |
logarithm of the particle’s | |
logarithm of the particle’s energy | |
logarithm of the particle’s relative to the jet | |
logarithm of the particle’s energy relative to the jet energy | |
angular separation between the particle and the jet axis () | |
transverse impact parameter of the track | |
err | uncertainty associated with the measurement of the |
longitudinal impact parameter of the track | |
err | uncertainty associated with the measurement of the |
charge | electric charge of the particle |
isElectron | whether the particle is an electron |
isMuon | whether the particle is a muon |
isChargedKaon | whether the particle is a charged Kaon |
isChargedPion | whether the particle is a charged Pion |
isProton | whether the particle is a proton |
isNeutralHadron | whether the particle is a neutral hadron |
isPhoton | whether the particle is a photon |
The ParticleNet based on Graph Neural Network (GNN) zhou2020graph was published at the beginning of 2019. The architecture of ParticleNet is shown in the left plot of Fig. 5. It consists of three EdgeConv blocks, one channel-wise global average pooling block, and two fully connected blocks followed by a softmax function to output the b/c/light-likeness for each jet. The core concept of ParticleNet is the EdgeConv operation, which is realized by applying feature aggregation for each particle and its k nearest particles in the jet. The specific process of each EdgeConv block is illustrated in the right plot of Fig. 5. It starts by finding the -nearest neighbors for each particle within the jet. The edge between each particle and its -nearest neighbors is determined using the input features of each particle. In the first EdgeConv block, the spatial coordinates () of the particles in the pseudorapidity-azimuth space are used to compute the edge of each pair of particles, while the subsequent EdgeConv blocks use the learned feature vectors as coordinates. The input features for our task, listed in Table 3, include the kinematic variables constructed with the 4-momentum of each particle, the PID information, the charge, and impact parameters. The distance between the interaction point and the path of a track is defined as the impact parameter, where the distance along the beam is called and perpendicular to the beam is called .





Both flavor tagging algorithms assign three values to each jet: b-likeness, c-likeness, and light-likeness, with the constraint that their sum equals unity. The scatter plots in Fig. 6 show the distribution of b-likeness versus c-likeness for samples of /light quarks with ParticleNet. In these plots, b-jets tend to concentrate in the region of larger b-likeness, c-jets in the region of larger c-likeness, and light-jets in the region of smaller b/c-likeness. The phase space spanned by the b/c-likeness is divided into three different regions corresponding to identified b, c, and light quarks. We then obtain the ratios of b-jets identified as b-jets, b-jets identified as c-jets, and so on. These ratios can be represented with a migration matrix, as shown in Fig. 7. The working point (phase space separation) can be optimized according to the specific analysis requirements. For general cases, we adopt the method using two orthogonal lines passing through the point (0.5, 0.5), as depicted by the two red lines in figure 6.
4 Performance analyses
The performance of ParticleNet and LCFIPlus is evaluated by the following three criteria. The first one is the migration matrix since the perfect flavor tagging performance corresponds to the identity matrix. The second one is the physics performance, the better flavor tagging algorithm would induce better physics results. The last one is the vertex detector optimization since it is relevant to the resolution of transverse momentum and impact parameters, and further quantifies the reconstructed performance of the detector.
4.1 Performance comparison and impact on benchmarks of and
Figure 7 displays the migration matrices obtained using LCFIPlus and ParticleNet, respectively. Comparing the performance of LCFIPlus, ParticleNet achieves a significant improvement in b/c-tagging efficiency, with an enhancement of 15% for b jets and 32% for c jets. The trace of the matrix abbreviated as is 3.0 for perfect jet flavor tagging performance and it increases from 2.30 to 2.64 with the utilization of ParticleNet. Both LCFIPlus and ParticleNet face a more challenging task in c-tagging, as its properties lie between those of b and light jets.


In the top plot of Fig. 8, we present the correlation between jet flavor tagging performance, described by , and jet polar angle, which is defined as the angle with respect to the beam line and represented by the angle . Both LCFIPlus and ParticleNet exhibit better performance in the barrel region compared to the endcap region, due to the relatively lower resolution of transverse momentum () and impact parameters ( and ) in the endcap region. The value of ParticleNet performance divided by LCFIPlus performance can be used to describe the performance improvement of ParticleNet relative to LCFIPlus. The bottom plot of Fig. 8 shows the correlation between those values and the jet polar angle. Compared to LCFIPlus, ParticleNet can improve the trace of the migration matrix by more than 10% in the barrel region and more than 30% in the endcap regions.


The performance of both flavor tagging algorithms is compared in benchmark analyses. The first analysis we look into is the signal strength measurement of . In the paper HiggsBCG , the authors demonstrate a correlation between the trace of the migration matrix and the accuracy of the signal strength of when CEPC operates as a Higgs factory at the center-of-mass energy of 240 GeV and integrated luminosity of 5.6 , as depicted in the top plot of Fig. 9. Using LCFIPlus, the trace is 2.30, corresponding to an accuracy of 0.057, indicated by the green star. ParticleNet enhances the trace to 2.64, aligning with an accuracy of 0.042, represented by the orange star. The second analysis is the signal strength measurement of , the magnitude of , which is one of the Cabibbo-Kobayashi-Maskawa (CKM) matrix elements and governs the transition between charm and bottom quarks. Accurate measurement of plays a pivotal role in the study of weak interactions within the SM. When CEPC operates as a Higgs factory at the center-of-mass energy of 240 GeV and integrated luminosity of 5.6 , the authors demonstrate that ParticleNet can significantly improve the accuracy of signal strength by 75% in the measurement of through Vcb , as depicted in the bottom plot of Fig. 9.
4.2 Comparison on vertex detector optimization
Jet flavor tagging performance depends on the detector design, especially the vertex detector. The vertex detector is mainly characterized by three parameters: material budget, spatial resolution, and inner radius. The CEPC vertex detector is designed with three concentric cylinders of double-sided layers, and the parameters are listed in Table 2.
In a previous study VTX using the LCFIPlus flavor tagging algorithm, the correlation between c-jet tagging efficiency multiplied by purity () and relevant vertex detector parameters was quantified. The measurement of Higgs at the CEPC revealed a correlation between and the c-jet tagging . By combining these correlations, the relationship between and relevant vertex detector parameters can be obtained, shown as the top plot of Fig. 10. This correlation is formulated in expression 1, where // represent the default design of CEPC vertex detector and // represent the new design. The coefficients fitted from the correlations indicate the importance of the corresponding detector parameter on the flavor tagging performance. The results obtained from LCFIPlus demonstrate that the flavor tagging performance is more sensitive to the inner radius, followed by the material budget, and lastly the spatial resolution.
(1) |
(2) |


The same analysis was conducted using ParticleNet. The Z-pole samples are fully simulated based on different vertex detector configurations and fed to ParticleNet to train. The results are illustrated in the bottom plot of Fig. 10 and equation 2. Compared to LCFIPlus, ParticleNet exhibits a larger value (2.64 v.s. 2.30), and its coefficients are roughly 50% of those of LCFIPlus. In other words, the ParticleNet has a lower dependence on the geometric parameters. However, both methods have the same order of impact for three different geometric parameters: both identify the inner radius as the most sensitive to flavor tagging performance and spatial resolution as the least sensitive.
conservative | baseline | optimal | ||
---|---|---|---|---|
LCFIPlus | 0.071 | 0.057 | 0.047 | |
ParticleNet | 0.045 | 0.042 | 0.038 | |
1.58 | 1.36 | 1.26 | ||
LCFIPlus | 0.0241 | 0.0133 | 0.0091 | |
ParticleNet | 0.0086 | 0.0076 | 0.0067 | |
2.80 | 1.75 | 1.36 |
The influence of geometric modifications on benchmark analyses can be assessed by referring to Fig. 10 in subsection 4.1. Consider two scenarios: one optimal and the other conservative, where the values of three vertex detector parameters is 0.5/2 times compared to the baseline design. This adjustment leads to changes in from 2.64 to 2.75/2.53 for ParticleNet and from 2.30 to 2.10/2.50 for LCFIPlus, as indicated by the vertical lines in Fig. 9. The accuracy of and measurement under different scenarios using ParticleNet and LCFIPlus is presented in Table 4. Compared to LCFIPlus, ParticleNet significantly improves the accuracy of benchmark measurements. In the baseline scenario, the improvement is 36% and 75% for and measurement, respectively. While at the conservative scenario, the improvement can be enhanced to 58% for and nearly 3 times for .
5 Conclusion
Flavor tagging, a methodology employed to discern the origins of jets, holds immense significance in the realm of experimental exploration at the High Energy Frontier. Jets originating from different quarks or gluons have key differences, represented in the multiplicity of different species of particles, the secondary vertices, the opening angle of jets, etc. The flavor tagging performance depends on both the flavor tagging algorithm and detector design. To pursue excellent discovery power and innovative design of the detector, intensive research and design towards the key detector technologies, especially towards the vertex detectors are performed. Meanwhile, the development of innovative algorithms injects new momentum into this field.
In this paper, we analyze the performance of ParticleNet and LCFIPlus. The ParticleNet based on GNN has been intensively used at CMS PRLQu ; PN1 ; PN2 and FCC-ee FCCPN . The LCFIPlus is a GBDT-based algorithm that has served as the baseline flavor tagging algorithm for CEPC and multiple future electron-positron Higgs factories. Using fully simulated hadronic events at a center-of-mass energy of 91.2 GeV at the CEPC baseline detector, we quantify the performance of both algorithms. We use a 3-dimensional migration matrix to describe the flavor tagging performance (representing the identification efficiency and misidentification rate), and the trace of the migration matrix is used as the key parameter to characterize flavor tagging.
At the CEPC baseline detector geometry, we observe that ParticleNet is significantly superior to LCFIPlus. At the inclusive hadronic Z pole sample, the trace of ParticleNet is larger than LCFIPlus by more than 14%. Consequently, the relative statistical accuracy of and measurement via W boson decay is improved by 36% and 75%, respectively, when CEPC operates as a Higgs factory at the center-of-mass energy of 240 GeV and integrated luminosity of 5.6 . Another paper Liao:2022ufk shows that ParticleNet can improve the statistical uncertainty of measurement by 60% at the CEPC. The flavor tagging performance, which is described by , of both ParticleNet and LCFIPlus depends on the polar angle. Both algorithms exhibit better performance in the barrel and smoothly degrade in the forward region. We also apply ParticleNet to different vertex detector geometries and observe that the flavor tagging performance is most sensitive to the inner radius, followed by the material budget and the spatial resolution. The result is consistent with that conducted by LCFIPlus. Benchmark performance in two scenarios, conservative and optimal, where the values of three vertex detector parameters are 2 and 0.5 times compared to the baseline design, reveals that ParticleNet can significantly enhance physics performance in the conservative scenario while showing less significant improvement with the aggressive detector design.
Acknowledgements.
We thank the computing center of the Institute of High Energy Physics for providing the computing resources. Thanks to Gang Li, Congqiao Li, and Shudong Wang for providing guidance on software. This project is supported by the Fundamental Research Funds for the Central Universities, Peking University. This project is also supported by the National Natural Science Foundation of China under grant No. 12042507.References
- (1) European Strategy Group, Deliberation document on the 2020 Update of the European Strategy for Particle Physics (Brochure), CERN-ESU-016
- (2) J. B. Guimarães da Costa et al. [CEPC Study Group], CEPC Conceptual Design Report: Volume 2 - Physics & Detector, arXiv:1811.10545 [hep-ex]
- (3) K. Sun, Y. Wang, J. Liu, Y. Zhu, M. Ruan and Y. Wang, A conceptual design of TOF based on MRPC technology for the future electron-positron Higgs factory, arXiv:2306.11512 [hep-ex]
- (4) H. Qu and L. Gouskos, ParticleNet: Jet Tagging via Particle Clouds, Phys. Rev. D 101 (2020) no.5, 056019
- (5) T. Suehara, T. Tanabe, LCFIPlus, A Framework for jet Analysis in Linear Collider Studies, arXiv: 1506.08371 [physics.ins-det]
- (6) Z. Wu, G. Li, D. Yu, C. Fu, Q. Ouyang, M. Ruan, Study of vertex optimization at the CEPC, Journal of Instrumentation. 13. T09002-T09002, doi:10.1088/1748-0221/13/09/T09002
- (7) Y. Zhu and M. Ruan, Performance study of the full hadronic WW and ZZ events´ separation at the CEPC, https://doi.org/10.1140/epjc/s10052-019-6719-2
- (8) W. Kilian, T. Ohl, J. Reuter, WHIZARD: Simulating Multi-Particle Processes at LHC and ILC, Eur.Phys.J.C 71 (2011) 1742,
- (9) Sjöstrand, Torbjörn, Stephen Mrenna and Peter Skands, PYTHIA 6.4 Physics and Manual, Journal of High Energy Physics 2006.05 (2006): 026–026. Crossref. Web.
- (10) P. Moras de Freitas et al., MOKKA: A detailed Geant4 simulation for the international linear collider detectors, https://flcwiki.desy.de/Mokka
- (11) F. Gaede, S. Aplin, R. Glattauer, C. Rosemann, and G. Voutsinas, Track reconstruction at the ILC: the ILD tracking software, J. Phys. Conf. Ser., Vol. 513, P. 022011, 2014
- (12) M. Ruan, Arbor, a new approach of the Particle Flow Algorithm, arXiv: 1403.4784 [physics.ins-det]
- (13) H. Baer, T. Barklow, K. Fujii, Y. Gao, The International Linear Collider Technical Design Report - Volume 2: Physics, [arXiv:1306.6352 [hep-ph]], pg.33
- (14) A. Robson, P. N. Burrows, N. Catalan Lasheras, L. Linssen, M. Petric, D. Schulte, E. Sicking, S. Stapnes and W. Wuensch, The Compact Linear e+e- Collider (CLIC): Accelerator and Detector, arXiv:1812.07987 [physics.acc-ph]
- (15) A. Abbrescia, M. AbdusSalam et al., FCC-ee: The Lepton Collider, Eur. Phys. J. Spec. Top. 228, 261-623 (2019)
- (16) Y. Coadou, Boosted decision trees, doi:10.1142/9789811234033_0002, arXiv:2206.09645 [physics.data-an]
- (17) J. Zhou, G. Cui, S. Hu, Z. Zhang, C. Yang, Z. Liu, L. Wang, C. Li and M. Sun, Graph neural networks: A review of methods and applications, arXiv:1812.08434 [cs.LG]
- (18) Y. Zhu, H. Cui and M. Ruan, The Higgsbb, cc, gg measurement at CEPC, J. High Energ. Phys. 2022, 100 (2022)
- (19) H. Liang, Y. Zhu, Y. Wang and M. Ruan, Measurements of and Higgs Rare and Flavor-Changing Neutral Current Hadronic Decay at CEPC, https://indico.ihep.ac.cn/event/19839/contributions/138763
- (20) CMS Collaboration, Search for Higgs Boson Decay to a Charm Quark-Antiquark Pair in Proton-Proton Collisions at , https://link.aps.org/doi/10.1103/PhysRevLett.131.061801
- (21) CMS Collaboration, Mass regression of highly-boosted jets using graph neural networks, https://cds.cern.ch/record/2777006
- (22) CMS Collaboration, Calibration of the mass-decorrelated ParticleNet tagger for boosted and jets using LHC Run 2 data, https://cds.cern.ch/record/2805611
- (23) B. Franco, G. Loukas and S. Michele, Jet flavour tagging for future colliders with fast simulation, doi:10.1140/epjc/s10052-022-10609-1
- (24) L. Liao, S. Wang, W. Song, Z. Zhang and G. Li, Performance studies of jet flavor tagging and measurement of using ParticleNet at CEPC, arXiv:2208.13503 [hep-ex]