-
Following the discovery of the 125 GeV Standard Model (SM)-like Higgs boson at the Large Hadron Collider (LHC) [1, 2], the possibility of additional scalar particles has attracted considerable attention. Although the SM contains only one fundamental scalar, there is no symmetry preventing the existence of more. Such additional scalars are theoretically motivated as they may play essential roles in addressing baryon asymmetry, electroweak phase transition, and dark matter [3−5].
Intriguingly, multiple experiments have reported possible local excesses near 95 GeV in different channels, including
$ b\bar{b} $ at LEP [6],$ \gamma\gamma $ at CMS and ATLAS [7, 8], and$ \tau^+\tau^- $ at CMS [9]. These excesses can be characterized by the following signal strength modifiers [10]:$ \begin{aligned}[b] \mu_{bb}^{\exp} =\;& \frac{\sigma_{\mathrm{BSM}}(e^+e^-\to ZS(\to b\bar{b}))}{\sigma_{\mathrm{SM}}(e^+e^-\to Zh_{95}(\to b\bar{b}))} = 0.117 \pm 0.057\,, \\ \mu_{\tau\tau}^{\exp} =\;& \frac{\sigma_{\mathrm{BSM}}(gg\to S\to \tau^+\tau^-)}{\sigma_{\mathrm{SM}}(gg\to h_{95}\to\tau^+\tau^-)} = 1.2 \pm 0.5\,, \\ \mu_{\gamma\gamma}^{\exp} =\;& \frac{\sigma_{\mathrm{BSM}}(gg\to S\to \gamma\gamma)}{\sigma_{\mathrm{SM}}(gg\to h_{95}\to\gamma\gamma)} = 0.24^{+0.09}_{-0.08}\,, \end{aligned}$
(1) where S denotes a hypothetical new scalar, and
$ h_{95} $ is the SM Higgs boson with the mass rescaled to 95 GeV. It is worth noting that another CMS report on the$ S\to\tau\tau $ channel in association with top quark pairs [11] presents results that differ from the di-tau excess discussed here. As that analysis remains preliminary, without subsequent updates, we focus our discussion on the currently available and more finalized results [9].These hints have stimulated a broad range of phenomenological studies on simple extensions of the SM [12−23], multi-Higgs model [10, 24−49], supersymmetric models [50−63], and Georgi-Machacek models [64−68]. Ref. [69] presents the light Higgs (lighter than 120 GeV) discovery potential at the International Linear Collider (ILC) in the
$ S\to b\bar{b} $ decay channel and a center-of-mass energy of$ \sqrt{s} = 250\;\text{GeV} $ . Ref. [70] provides the exclusion limit on the Z-light Higgs coupling at the ILC in the Higgs-strahlung process with decay channel independence. Through the study of the$ S\to\tau\tau $ decay channel, Ref. [71] states that, in the search for a light Higgs, the search sensitivity for the$ S\to\tau\tau $ decay channel is higher than that for the decay channel-independent one. Ref. [39] features the potential of the Circular Electron-Positron Collider (CEPC) to probe this hypothetical scalar through the$ S\to b\bar{b} $ channel, considering a center-of-mass energy of$ \sqrt{s} = 250\;\text{GeV} $ .Among the observed anomalies, the possible excess in the
$ \tau^+\tau^- $ channel is even more pronounced, and in certain beyond the SM (BSM) scenarios, such as the two Higgs doublet model (2HDM) [72], the decay of$ S \to \tau^+\tau^- $ can be enhanced significantly. This motivates a dedicated investigation into the$ \tau^+\tau^- $ final state. The CEPC, primarily designed for precision measurements of the 125 GeV SM-like Higgs boson, is expected to operate at$ \sqrt{s} = 240 $ GeV with an integrated luminosity of approximately 20 ab$ ^{-1} $ [73−76]. However, as a 95 GeV scalar is significantly lighter than the design target of the CEPC, the optimal center-of-mass energy ($ \sqrt{s} $ ) for its discovery remains to be systematically assessed. Motivated by these considerations, the first part of this study is dedicated to identifying the optimal collision energy for probing a light Higgs with a mass near 95 GeV. In addition, we investigate the discovery potential of a light Higgs boson at 95 GeV in the$ S\to\tau\tau $ decay channel by employing machine learning (ML) techniques such as eXtreme Gradient Boosting (XGBoost), Gradient Boosting Decision Tree (GBDT), and deep neural networks (DNNs) to enhance the sensitivity. Furthermore, a detailed analysis is performed to evaluate the expected precision of the$ Z-S $ coupling and the branching ratio of$ S\to\tau\tau $ at the CEPC. Finally, as a concrete example, we apply our approach within the Next-to 2HDM (N2HDM) to assess its capability in testing specific new physics scenarios. -
After the discovery of the 125 GeV Standard Model (SM)-like Higgs boson at the Large Hadron Collider (LHC) [1, 2], the possibility of additional scalar particles has attracted considerable attention. While the SM contains only one fundamental scalar, there is no symmetry preventing the existence of more. Such extra scalars are theoretically motivated as they may play essential roles in addressing the baryon asymmetry, electroweak phase transition, and dark matter [3−5].
Intriguingly, multiple experiments have reported possible local excesses near 95 GeV in different channels, including
$ b\bar{b} $ at LEP [6],$ \gamma\gamma $ at CMS and ATLAS [7, 8], and$ \tau^+\tau^- $ at CMS [9]. These excesses can be characterized by the signal strength modifiers [10]:$ \begin{aligned}[b] \mu_{bb}^{\exp} =\;& \frac{\sigma_{\mathrm{BSM}}(e^+e^-\to ZS(\to b\bar{b}))}{\sigma_{\mathrm{SM}}(e^+e^-\to Zh_{95}(\to b\bar{b}))} = 0.117 \pm 0.057\,, \end{aligned}$
$ \begin{aligned}[b] \mu_{\tau\tau}^{\exp} =\;& \frac{\sigma_{\mathrm{BSM}}(gg\to S\to \tau^+\tau^-)}{\sigma_{\mathrm{SM}}(gg\to h_{95}\to\tau^+\tau^-)} = 1.2 \pm 0.5\,, \\ \mu_{\gamma\gamma}^{\exp} =\;& \frac{\sigma_{\mathrm{BSM}}(gg\to S\to \gamma\gamma)}{\sigma_{\mathrm{SM}}(gg\to h_{95}\to\gamma\gamma)} = 0.24^{+0.09}_{-0.08}\,, \end{aligned}$
(1) where S denotes a hypothetical new scalar, and
$ h_{95} $ is the SM Higgs boson with mass rescaled to 95 GeV. It is worth noting that another CMS report on the$ S\to\tau\tau $ channel in association with top quark pairs [11] presents results that differ from the di-tau excess discussed here. As that analysis remains preliminary without subsequent updates, we have focused our discussion on the currently available and more finalized results [9].These hints have stimulated a broad range of phenomenological studies in the simple extensions of the SM [12−23], the multi-Higgs model [10, 24−49], the supersymmetric models [50−63], and the Georgi-Machacek models [64−68]. Ref. [69] studies the light Higgs (lighter than 120 GeV) discovery potential at the International Linear Collider (ILC) in the
$ S\to b\bar{b} $ decay channel and a center-of-mass energy of$ \sqrt{s} = 250\,\text{GeV} $ . Ref. [70] gives the exclusion limit on the Z-light Higgs coupling at the ILC in the Higgs-strahlung process with decay channel-independent. Through the study of the$ S\to\tau\tau $ decay channel, Ref. [71] finds that in the search for light Higgs, the search sensitivity for the$ S\to\tau\tau $ decay channel is higher than that for the decay channel-independent one. Ref. [39] explored the potential of the Circular Electron-Positron Collider (CEPC) to probe this hypothetical scalar through the$ S\to b\bar{b} $ channel, considering a center-of-mass energy of$ \sqrt{s} = 250\,\text{GeV} $ .Among the observed anomalies, the possible excess in the
$ \tau^+\tau^- $ channel is even more pronounced, and in certain beyond the Standard Model (BSM) scenarios, such as the two Higgs doublet model (2HDM) [72], the decay of$ S \to \tau^+\tau^- $ can be significantly enhanced. This motivates a dedicated investigation of the$ \tau^+\tau^- $ final state. The CEPC, primarily designed for precision measurements of the 125 GeV SM-like Higgs boson, is expected to operate at$ \sqrt{s} = 240 $ GeV with an integrated luminosity of about 20 ab$ ^{-1} $ [73−76]. However, since a 95 GeV scalar is significantly lighter than the design target of the CEPC, the optimal center-of-mass energy ($ \sqrt{s} $ ) for its discovery remains to be systematically assessed. Motivated by these considerations, the first part of this work is dedicated to identifying the optimal collision energy for probing a light Higgs with mass near 95 GeV. In addition, we investigate the discovery potential of a light Higgs boson at 95 GeV in the$ S\to\tau\tau $ decay channel, employing machine learning techniques such as eXtreme Gradient Boosting (XGBoost), Gradient Boosting Decision Tree (GBDT), and Deep Neural Network (DNN) to enhance the sensitivity. Furthermore, a detailed analysis is performed to evaluate the expected precision of the$ Z-S $ coupling and the branching ratio of$ S\to\tau\tau $ at the CEPC. Finally, as a concrete example, we apply our approach within the Next-to Two Higgs Doublet Model (N2HDM) to assess its capability in testing specific new physics scenarios. -
Following the discovery of the 125 GeV Standard Model (SM)-like Higgs boson at the Large Hadron Collider (LHC) [1, 2], the possibility of additional scalar particles has attracted considerable attention. Although the SM contains only one fundamental scalar, there is no symmetry preventing the existence of more. Such additional scalars are theoretically motivated as they may play essential roles in addressing baryon asymmetry, electroweak phase transition, and dark matter [3−5].
Intriguingly, multiple experiments have reported possible local excesses near 95 GeV in different channels, including
$ b\bar{b} $ at LEP [6],$ \gamma\gamma $ at CMS and ATLAS [7, 8], and$ \tau^+\tau^- $ at CMS [9]. These excesses can be characterized by the following signal strength modifiers [10]:$ \begin{aligned}[b] \mu_{bb}^{\exp} =\;& \frac{\sigma_{\mathrm{BSM}}(e^+e^-\to ZS(\to b\bar{b}))}{\sigma_{\mathrm{SM}}(e^+e^-\to Zh_{95}(\to b\bar{b}))} = 0.117 \pm 0.057\,, \\ \mu_{\tau\tau}^{\exp} =\;& \frac{\sigma_{\mathrm{BSM}}(gg\to S\to \tau^+\tau^-)}{\sigma_{\mathrm{SM}}(gg\to h_{95}\to\tau^+\tau^-)} = 1.2 \pm 0.5\,, \\ \mu_{\gamma\gamma}^{\exp} =\;& \frac{\sigma_{\mathrm{BSM}}(gg\to S\to \gamma\gamma)}{\sigma_{\mathrm{SM}}(gg\to h_{95}\to\gamma\gamma)} = 0.24^{+0.09}_{-0.08}\,, \end{aligned}$
(1) where S denotes a hypothetical new scalar, and
$ h_{95} $ is the SM Higgs boson with the mass rescaled to 95 GeV. It is worth noting that another CMS report on the$ S\to\tau\tau $ channel in association with top quark pairs [11] presents results that differ from the di-tau excess discussed here. As that analysis remains preliminary, without subsequent updates, we focus our discussion on the currently available and more finalized results [9].These hints have stimulated a broad range of phenomenological studies on simple extensions of the SM [12−23], multi-Higgs model [10, 24−49], supersymmetric models [50−63], and Georgi-Machacek models [64−68]. Ref. [69] presents the light Higgs (lighter than 120 GeV) discovery potential at the International Linear Collider (ILC) in the
$ S\to b\bar{b} $ decay channel and a center-of-mass energy of$ \sqrt{s} = 250\;\text{GeV} $ . Ref. [70] provides the exclusion limit on the Z-light Higgs coupling at the ILC in the Higgs-strahlung process with decay channel independence. Through the study of the$ S\to\tau\tau $ decay channel, Ref. [71] states that, in the search for a light Higgs, the search sensitivity for the$ S\to\tau\tau $ decay channel is higher than that for the decay channel-independent one. Ref. [39] features the potential of the Circular Electron-Positron Collider (CEPC) to probe this hypothetical scalar through the$ S\to b\bar{b} $ channel, considering a center-of-mass energy of$ \sqrt{s} = 250\;\text{GeV} $ .Among the observed anomalies, the possible excess in the
$ \tau^+\tau^- $ channel is even more pronounced, and in certain beyond the SM (BSM) scenarios, such as the two Higgs doublet model (2HDM) [72], the decay of$ S \to \tau^+\tau^- $ can be enhanced significantly. This motivates a dedicated investigation into the$ \tau^+\tau^- $ final state. The CEPC, primarily designed for precision measurements of the 125 GeV SM-like Higgs boson, is expected to operate at$ \sqrt{s} = 240 $ GeV with an integrated luminosity of approximately 20 ab$ ^{-1} $ [73−76]. However, as a 95 GeV scalar is significantly lighter than the design target of the CEPC, the optimal center-of-mass energy ($ \sqrt{s} $ ) for its discovery remains to be systematically assessed. Motivated by these considerations, the first part of this study is dedicated to identifying the optimal collision energy for probing a light Higgs with a mass near 95 GeV. In addition, we investigate the discovery potential of a light Higgs boson at 95 GeV in the$ S\to\tau\tau $ decay channel by employing machine learning (ML) techniques such as eXtreme Gradient Boosting (XGBoost), Gradient Boosting Decision Tree (GBDT), and deep neural networks (DNNs) to enhance the sensitivity. Furthermore, a detailed analysis is performed to evaluate the expected precision of the$ Z-S $ coupling and the branching ratio of$ S\to\tau\tau $ at the CEPC. Finally, as a concrete example, we apply our approach within the Next-to 2HDM (N2HDM) to assess its capability in testing specific new physics scenarios. -
We investigate the process
$ e^+e^- \to Z(\to\mu^+\mu^-) S(\to\tau^+\tau^-) $ at the CEPC. The dominant irreducible background is$ e^+e^- \to Z(\to\mu^+\mu^-)\tau^+\tau^- $ , hereafter denoted as$ Z\tau\tau $ , which contains the same final‐state topology as the signal. A sizable reducible background arises from$ e^+e^- \to Z(\to\mu^+\mu^-)XX $ ($ X = j,\,e,\,\mu $ ), collectively labeled$ Zjj $ , where one or two X objects can be mis-tagged as hadronic τ jets. Signal and background events are simulated using$ {\textsf{MadGraph5}}\_{\textsf{aMC@NLO}}\_{\textsf{v3.4.2}}$ [77, 78], interfaced to$ {\textsf{PYTHIA 8.2}}$ [79] for parton showering and hadronization and$ {\textsf{Delphes 3.5.0}}$ [80] for detector simulation using the CEPC baseline card, with τ-jets identified at an efficiency of 80% [81].Owing to the difficulty of reconstructing τ decays, we employ the recoil mass observable [82], which allows the invariant mass of the scalar S to be inferred without relying on the reconstruction of its decay products. The recoil mass is defined as
$ M_{\mathrm{recoil}} \equiv \sqrt{s + M_{\mu^+\mu^-}^2 - 2\sqrt{s}(E_{\mu^+} + E_{\mu^-})}, $
(2) and it depends only on the well-measured four-momentum of the muon pair. This observable is robust against τ-decay ambiguities and provides strong discriminating power between the signal and background.
Figure 1 shows the
$ M_{\mathrm{recoil}} $ distributions for the signal$ ZS $ with$ M_S = 95.5 $ GeV, backgrounds$ Z\tau\tau $ and$ Zjj $ , total (signal + background), and ratio of the total to the background. The number of signal events is normalized to 0.2, reflecting the assumption that the signal cross section is 20% of the SM prediction, whereas the background event numbers are normalized to 1. The signal exhibits a pronounced peak around$ M_S $ , whereas both backgrounds show broader distributions peaking near 91 GeV, consistent with the Z mass. The total distribution is background-dominated across the mass range, as further reflected in the lower panel, where the ratio of the total to the backgrounds exhibits a localized peak near the signal mass.
Figure 1. (color online) Recoil mass distributions for the signal process
$ ZS $ with$ M_S = 95.5 $ (normalized to 0.2), backgrounds$ Z\tau\tau $ and$ Zjj $ (normalized to 1), and total (signal + background). The lower panel shows the ratio of the total to the background.We scan the center-of-mass energy
$ \sqrt{s} $ from 190 to 240 GeV in 2 GeV steps and vary the scalar mass$ M_S $ from 94 to 100 GeV in 0.5 GeV increments. For illustration, we consider the signal cross section to be 20% of the corresponding SM process with$ m_h=M_S $ , as computed using$ {\textsf{MadGraph5}}\_{\textsf{aMC@NLO}}\_{\textsf{v3.4.2}}$ [77, 78].Event selection requires two identified muons to reconstruct
$ M_{\mathrm{recoil}} $ and at least one τ-tagged jet to suppress the$ Zjj $ background. A second τ-tag is not required, as its efficiency is limited and would significantly reduce the signal acceptance despite better background rejection [81]. Basic kinematic requirements are imposed on the muons:$ \begin{array}{r} p_{\mathrm{T}}({\mu}) \gt 10 \; {\rm{GeV}},\quad p_{T}({\tau}) \gt 20 \; {\rm{GeV}}, \\ |\eta(\mu)| \lt 2.5,\quad \Delta R(\mu_1,\mu_2) \gt 0.4, \end{array} $
(3) where
$ p_\mathrm{T} $ and η denote the transverse momentum and pseudorapidity, respectively, and$ \Delta R = \sqrt{(\Delta\eta)^2 + (\Delta\phi)^2} $ measures the angular separation of the muon pair. In addition, the recoil mass is required to lie within$ |M_{\mathrm{recoil}} - M_S| \lt 1.5 $ GeV to suppress the backgrounds further.The basic selection suppresses the reducible
$ Zjj $ background by roughly four orders of magnitude. Requiring$ |M_{\mathrm{recoil}}-M_S|<1.5 $ GeV removes approximately 90% of the remaining$ Zjj $ and$ Z\tau\tau $ events while retaining approximately 80% of the signal.Figure 2 shows the signal significance
$ \mathcal{Z} $ in the$ (\sqrt{s}, M_S) $ plane, evaluated for an integrated luminosity of$ L = 500 \ {\rm{fb}} ^{-1} $ . The significance is computed using the Poisson expression [83]
Figure 2. (color online) Signal significance
$ \mathcal{Z} $ in the$ M_S $ versus$ \sqrt{s} $ plane, assuming an integrated luminosity of 500$ {\rm{fb}}^{-1} $ . The red star indicates the benchmark point at$ M_S = 95.5 $ GeV and$ \sqrt{s} = 210 $ GeV.$ \mathcal{Z}= \sqrt{2L\left[(\mathcal{S}+\mathcal{B})\ln(1+\mathcal{S}/\mathcal{B})-\mathcal{S}\right]}, $
(4) where
$ \mathcal{S} $ and$ \mathcal{B} $ denote the signal and background cross sections, respectively. Approximately one million signal and background events are generated for each parameter point. After applying the cut to$ M_{\mathrm{recoil}} $ , the dominant SM background originates from$ Z\tau\tau $ , with approximately 40,000 events surviving. The uncertainty regarding the expected number of background events is approximately 1%, which translates to a variation of approximately 0.4% in the estimated signal significance. At low statistics, this uncertainty is much smaller than the statistical uncertainty. Furthermore, we test a more conservative scenario by varying the expected number of background events by ±10%. The corresponding change in the signal significance is less than 4.5%. We observe a moderate increase in$ \mathcal{Z} $ with$ M_S $ , reflecting the more rapid decline in SM backgrounds at higher scalar masses. The optimal center-of-mass energy is found to be$ \sqrt{s} \simeq 210 $ GeV. At the benchmark point$ (M_S = 95.5 $ GeV and$ \sqrt{s} = 210 $ GeV), we obtain$ \mathcal{Z} \approx 5.1 \sigma $ . Reaching the same significance at the CEPC design energy of 240 GeV would require a luminosity of approximately 720${\rm{fb}} ^{-1} $ , approximately 1.4 times that required at 210 GeV. -
We investigate the process
$ e^+e^- \to Z(\to\mu^+\mu^-) S(\to\tau^+\tau^-) $ at the CEPC. The dominant irreducible background is$ e^+e^- \to Z(\to\mu^+\mu^-)\tau^+\tau^- $ , hereafter denoted$ Z\tau\tau $ , which contains the same final‐state topology as the signal. A sizable reducible background arises from$ e^+e^- \to Z(\to\mu^+\mu^-)XX $ ($ X = j,\,e,\,\mu $ ), collectively labelled$ Zjj $ , where one or two X objects can be mis-tagged as hadronic τ jets. Signal and background events are simulated with$ {\textsf{MadGraph5}}\_{\textsf{aMC@NLO}}\_{\textsf{v3.4.2}}$ [77, 78], interfaced to$ {\textsf{PYTHIA 8.2}}$ [79] for parton showering and hadronization, and$ {\textsf{Delphes 3.5.0}}$ [80] for detector simulation using the CEPC baseline card with τ-jets identified at an efficiency of 80% [81].Due to the difficulty of reconstructing τ decays, we employ the recoil mass observable [82], which allows the invariant mass of the scalar S to be inferred without relying on the reconstruction of its decay products. The recoil mass is defined as
$ M_{\mathrm{recoil}} \equiv \sqrt{s + M_{\mu^+\mu^-}^2 - 2\sqrt{s}(E_{\mu^+} + E_{\mu^-})}, $
(2) and depends only on the well-measured four-momentum of the muon pair. This observable is robust against τ-decay ambiguities and provides strong discriminating power between the signal and background.
Fig. 1 shows the
$ M_{\mathrm{recoil}} $ distributions for the signal$ ZS $ with$ M_S = 95.5 $ GeV, the backgrounds$ Z\tau\tau $ and$ Zjj $ , the total (signal + background), and the ratio of the total to the background. The number of signal events is normalized to 0.2, reflecting the assumption that the signal cross section is 20% of the Standard Model prediction, while the background event numbers are normalized to 1. The signal exhibits a pronounced peak around$ M_S $ , while both backgrounds show broader distributions peaking near 91 GeV, consistent with the Z mass. The total distribution is background-dominated across the mass range, as further reflected in the lower panel, where the ratio of the total to the backgrounds exhibits a localized peak near the signal mass.
Figure 1. (color online) Recoil mass distributions for the signal process
$ ZS $ with$ M_S = 95.5 $ (normalized to 0.2), backgrounds$ Z\tau\tau $ and$ Zjj $ (normalized to 1), and the total (signal + background). The lower panel shows the ratio of the total to the background.We scan the center-of-mass energy
$ \sqrt{s} $ from 190 to 240 GeV in 2 GeV steps and vary the scalar mass$ M_S $ from 94 to 100 GeV in 0.5 GeV increments. For illustration, we take the signal cross section to be 20% of the corresponding SM process with$ m_h=M_S $ , as computed using$ {\textsf{MadGraph5}}\_{\textsf{aMC@NLO}}\_{\textsf{v3.4.2}}$ [77, 78].Event selection requires two identified muons to reconstruct
$ M_{\mathrm{recoil}} $ and at least one τ-tagged jet to suppress the$ Zjj $ background. A second τ-tag is not required, as its efficiency is limited and would significantly reduce signal acceptance despite better background rejection [81]. Basic kinematic requirements are imposed on the muons:$ \begin{array}{r} p_{\mathrm{T}}({\mu}) > 10 \; {\rm{GeV}},\quad p_{T}({\tau}) > 20 \; {\rm{GeV}}, \\ |\eta(\mu)| < 2.5,\quad \Delta R(\mu_1,\mu_2) > 0.4, \end{array} $
(3) where
$ p_\mathrm{T} $ and η denote the transverse momentum and pseudorapidity, respectively, and$ \Delta R = \sqrt{(\Delta\eta)^2 + (\Delta\phi)^2} $ measures the angular separation between the muon pair. Additionally, we require the recoil mass to lie within$ |M_{\mathrm{recoil}} - M_S| < 1.5 $ GeV to further suppress the backgrounds.The basic selection suppresses the reducible
$ Zjj $ background by roughly four orders of magnitude. Requiring$ |M_{\mathrm{recoil}}-M_S|<1.5 $ GeV removes about 90 % of the remaining$ Zjj $ and$ Z\tau\tau $ events while retaining ~ 80 % of the signal.Fig. 2 shows the signal significance
$ \mathcal{Z} $ in the$ (\sqrt{s}, M_S) $ plane, evaluated for an integrated luminosity of$ L = 500 \ {\rm{fb}} ^{-1} $ . The significance is computed using the Poisson expression [83],
Figure 2. (color online) Signal significance
$ \mathcal{Z} $ in the$ M_S $ versus$ \sqrt{s} $ plane, assuming an integrated luminosity of 500$ {\rm{fb}}^{-1} $ . The red star indicates the benchmark point at$ M_S = 95.5 $ GeV and$ \sqrt{s} = 210 $ GeV.$ \mathcal{Z}= \sqrt{2L\left[(\mathcal{S}+\mathcal{B})\ln(1+\mathcal{S}/\mathcal{B})-\mathcal{S}\right]}, $
(4) where
$ \mathcal{S} $ and$ \mathcal{B} $ denote the signal and background cross sections, respectively. About one million signal and background events are generated for each parameter point. After applying the cut on$ M_{\mathrm{recoil}} $ , the dominant SM background originates from$ Z\tau\tau $ , with approximately 40,000 events surviving. The uncertainty on the expected number of background events is approximately 1%, which translates to a variation of about 0.4% in the estimated signal significance. At low statistics, this uncertainty is much smaller than the statistical uncertainty. Furthermore, we have also tested a more conservative scenario by varying the expected number of background events by ±10%. The corresponding change in the signal significance is less than 4.5%. We observe a moderate increase in$ \mathcal{Z} $ with$ M_S $ , reflecting the more rapid decline of SM backgrounds at higher scalar masses. The optimal center-of-mass energy is found to be$ \sqrt{s} \simeq 210 $ GeV. At the benchmark point$ (M_S = 95.5 $ GeV,$ \sqrt{s} = 210 $ GeV), we obtain$ \mathcal{Z} \approx 5.1 \sigma $ . Reaching the same significance at the CEPC design energy of 240 GeV would require approximately 720${\rm{fb}} ^{-1} $ , about 1.4 times the luminosity needed at 210 GeV. -
We investigate the process
$ e^+e^- \to Z(\to\mu^+\mu^-) S(\to\tau^+\tau^-) $ at the CEPC. The dominant irreducible background is$ e^+e^- \to Z(\to\mu^+\mu^-)\tau^+\tau^- $ , hereafter denoted as$ Z\tau\tau $ , which contains the same final‐state topology as the signal. A sizable reducible background arises from$ e^+e^- \to Z(\to\mu^+\mu^-)XX $ ($ X = j,\,e,\,\mu $ ), collectively labeled$ Zjj $ , where one or two X objects can be mis-tagged as hadronic τ jets. Signal and background events are simulated using$ {\textsf{MadGraph5}}\_{\textsf{aMC@NLO}}\_{\textsf{v3.4.2}}$ [77, 78], interfaced to$ {\textsf{PYTHIA 8.2}}$ [79] for parton showering and hadronization and$ {\textsf{Delphes 3.5.0}}$ [80] for detector simulation using the CEPC baseline card, with τ-jets identified at an efficiency of 80% [81].Owing to the difficulty of reconstructing τ decays, we employ the recoil mass observable [82], which allows the invariant mass of the scalar S to be inferred without relying on the reconstruction of its decay products. The recoil mass is defined as
$ M_{\mathrm{recoil}} \equiv \sqrt{s + M_{\mu^+\mu^-}^2 - 2\sqrt{s}(E_{\mu^+} + E_{\mu^-})}, $
(2) and it depends only on the well-measured four-momentum of the muon pair. This observable is robust against τ-decay ambiguities and provides strong discriminating power between the signal and background.
Figure 1 shows the
$ M_{\mathrm{recoil}} $ distributions for the signal$ ZS $ with$ M_S = 95.5 $ GeV, backgrounds$ Z\tau\tau $ and$ Zjj $ , total (signal + background), and ratio of the total to the background. The number of signal events is normalized to 0.2, reflecting the assumption that the signal cross section is 20% of the SM prediction, whereas the background event numbers are normalized to 1. The signal exhibits a pronounced peak around$ M_S $ , whereas both backgrounds show broader distributions peaking near 91 GeV, consistent with the Z mass. The total distribution is background-dominated across the mass range, as further reflected in the lower panel, where the ratio of the total to the backgrounds exhibits a localized peak near the signal mass.
Figure 1. (color online) Recoil mass distributions for the signal process
$ ZS $ with$ M_S = 95.5 $ (normalized to 0.2), backgrounds$ Z\tau\tau $ and$ Zjj $ (normalized to 1), and total (signal + background). The lower panel shows the ratio of the total to the background.We scan the center-of-mass energy
$ \sqrt{s} $ from 190 to 240 GeV in 2 GeV steps and vary the scalar mass$ M_S $ from 94 to 100 GeV in 0.5 GeV increments. For illustration, we consider the signal cross section to be 20% of the corresponding SM process with$ m_h=M_S $ , as computed using$ {\textsf{MadGraph5}}\_{\textsf{aMC@NLO}}\_{\textsf{v3.4.2}}$ [77, 78].Event selection requires two identified muons to reconstruct
$ M_{\mathrm{recoil}} $ and at least one τ-tagged jet to suppress the$ Zjj $ background. A second τ-tag is not required, as its efficiency is limited and would significantly reduce the signal acceptance despite better background rejection [81]. Basic kinematic requirements are imposed on the muons:$ \begin{array}{r} p_{\mathrm{T}}({\mu}) \gt 10 \; {\rm{GeV}},\quad p_{T}({\tau}) \gt 20 \; {\rm{GeV}}, \\ |\eta(\mu)| \lt 2.5,\quad \Delta R(\mu_1,\mu_2) \gt 0.4, \end{array} $
(3) where
$ p_\mathrm{T} $ and η denote the transverse momentum and pseudorapidity, respectively, and$ \Delta R = \sqrt{(\Delta\eta)^2 + (\Delta\phi)^2} $ measures the angular separation of the muon pair. In addition, the recoil mass is required to lie within$ |M_{\mathrm{recoil}} - M_S| \lt 1.5 $ GeV to suppress the backgrounds further.The basic selection suppresses the reducible
$ Zjj $ background by roughly four orders of magnitude. Requiring$ |M_{\mathrm{recoil}}-M_S|<1.5 $ GeV removes approximately 90% of the remaining$ Zjj $ and$ Z\tau\tau $ events while retaining approximately 80% of the signal.Figure 2 shows the signal significance
$ \mathcal{Z} $ in the$ (\sqrt{s}, M_S) $ plane, evaluated for an integrated luminosity of$ L = 500 \ {\rm{fb}} ^{-1} $ . The significance is computed using the Poisson expression [83]
Figure 2. (color online) Signal significance
$ \mathcal{Z} $ in the$ M_S $ versus$ \sqrt{s} $ plane, assuming an integrated luminosity of 500$ {\rm{fb}}^{-1} $ . The red star indicates the benchmark point at$ M_S = 95.5 $ GeV and$ \sqrt{s} = 210 $ GeV.$ \mathcal{Z}= \sqrt{2L\left[(\mathcal{S}+\mathcal{B})\ln(1+\mathcal{S}/\mathcal{B})-\mathcal{S}\right]}, $
(4) where
$ \mathcal{S} $ and$ \mathcal{B} $ denote the signal and background cross sections, respectively. Approximately one million signal and background events are generated for each parameter point. After applying the cut to$ M_{\mathrm{recoil}} $ , the dominant SM background originates from$ Z\tau\tau $ , with approximately 40,000 events surviving. The uncertainty regarding the expected number of background events is approximately 1%, which translates to a variation of approximately 0.4% in the estimated signal significance. At low statistics, this uncertainty is much smaller than the statistical uncertainty. Furthermore, we test a more conservative scenario by varying the expected number of background events by ±10%. The corresponding change in the signal significance is less than 4.5%. We observe a moderate increase in$ \mathcal{Z} $ with$ M_S $ , reflecting the more rapid decline in SM backgrounds at higher scalar masses. The optimal center-of-mass energy is found to be$ \sqrt{s} \simeq 210 $ GeV. At the benchmark point$ (M_S = 95.5 $ GeV and$ \sqrt{s} = 210 $ GeV), we obtain$ \mathcal{Z} \approx 5.1 \sigma $ . Reaching the same significance at the CEPC design energy of 240 GeV would require a luminosity of approximately 720${\rm{fb}} ^{-1} $ , approximately 1.4 times that required at 210 GeV. -
Despite the baseline cuts, the signal remains obscured by irreducible backgrounds, particularly from the
$ Z\tau\tau $ process. To enhance the sensitivity, we apply ML techniques at the benchmark points ($ M_S = 95.5 $ GeV,$ \sqrt{s} = 210 $ and 240 GeV), comparing three classifiers: XGBoost, GBDT, and DNN.A set of 22 kinematic features is used as input, including:
● Four-momenta of
$ \mu_1 $ ,$ \mu_2 $ , and$ \tau_1 $ :$ E_{\mu_1,\mu_2,\tau_1} $ ,$ p_x^{\mu_1,\mu_2,\tau_1} $ ,$ p_y^{\mu_1,\mu_2,\tau_1} $ , and$ p_z^{\mu_1,\mu_2,\tau_1} $ .● Transverse momenta and pseudorapidities of
$ \mu_1 $ ,$ \mu_2 $ , and$ \tau_1 $ :$ p_\mathrm{T}^{\mu_1,\mu_2,\tau_1} $ and$ \eta_{\mu_1,\mu_2,\tau_1} $ .● Angular separations
$ \Delta R $ among$ \mu_1 $ ,$ \mu_2 $ , and$ \tau_1 $ :$ \Delta R[\mu_1, \mu_2] $ ,$ \Delta R[\mu_1, \tau_1] $ , and$ \Delta R[\mu_2, \tau_1] $ ● The recoil mass
$ M_{\mathrm{recoil}} $ .The DNN architecture consists of six hidden layers with 64, 48, 32, 24, 16, and 8 neurons, respectively. To mitigate overfitting and vanishing gradients, dropout with a rate of 20% is applied to the first two layers, and each hidden layer uses the Rectified Linear Unit (ReLU) activation function [84]. The output layer contains a single neuron with sigmoid activation. Training is performed using the Adam optimizer with a learning rate of 0.0001. Approximately 200k events are used for training and evaluation, with 70% for training and 30% for testing. The training and testing are performed on an RTX 3060 GPU, and the DNN typically converges within minutes. The model reaches a test accuracy of approximately 77% after 10 epochs. Finally, it reaches a classification accuracy of 78% (77%) and an area under the ROC curve (AUC) of 0.87 (0.85) at
$ \sqrt{s}=210\ (240) $ GeV. Inreasing the size of the training set further mainly improves the stability of the training process while offering limited gains in classification accuracy. Significant additional improvements would likely require more advanced ML architectures or direct access to particle-level information, which we leave for future work.Figure 3 shows the integrated luminosity required to achieve a
$ 5\sigma $ significance at the benchmark point as a function of the signal efficiency before and after applying the ML classifiers. Here, the signal efficiency is defined as the true positive rate (TPR):
Figure 3. (color online) Integrated luminosity required to reach
$ 5\sigma $ significance at the benchmark point as a function of the signal efficiency. The solid (dotted) curves correspond to$ \sqrt{s} = $ $ 210\ (240) $ GeV. The colors denote different selection strategies: baseline (before applying ML classifiers; green), DNN (red), GBDT (yellow), and XGBoost (blue).$ \mathrm{TPR} = \frac{N_{\mathrm{TP}}}{N_{\mathrm{TP}}+N_{\mathrm{FN}}} , $
(5) where
$ N_{\mathrm{TP}} $ is the number of correctly classified signal events, and$ N_{\mathrm{FN}} $ is the number of signal events that are misclassified as background. This quantity characterizes the efficiency of the ML classifier in recognizing signal events. The green, red, yellow, and blue curves (dotted for$ \sqrt{s} = 240 $ GeV) correspond to the baseline selection (before applying the ML classifiers), DNN, GBDT, and XGBoost, respectively, at$ \sqrt{s} = 210 $ GeV. At each energy, the DNN consistently requires the lowest luminosity for any given signal efficiency. The minimum required luminosity is attained at a signal efficiency of approximately 0.66 (0.74) for$ \sqrt{s} = 210\ (240) $ GeV with the DNN, reducing the required luminosity from$ 480\ (690)\;\mathrm{fb}^{-1} $ in the baseline to less than$ 210\ (310)\;\mathrm{fb}^{-1} $ . The full cut flow with$ L = 500\;\mathrm{fb}^{-1} $ is summarized in Table 1.Cuts $ ZS $ $ Z\tau\tau $ $ Zjj $ $ \mathcal{Z} [\sigma] $ Initial 134 (109) 1819 (1769) 24187 (21745) 0.83 (0.71) Basic 59.7 (52.7) 780.2 (816.7) 49.4 (4.7) 2.0 (1.8) $ M_{\mathrm{recoil}} $ 47.4 (38.7) 66.0 (66.0) 5.2 (4.7) 5.1 (4.3) ML 31.5 (28.6) 8.2 (11.5) 0.6 (0.8) 7.7 (6.4) Table 1. Cut flow of signal and background events (in units of events per 500 fb
$ ^{-1} $ ) for$ \sqrt{s} = 210 $ (240) GeV. The numbers in parentheses refer to the 240 GeV case. The final column lists the resulting signal significance$ \mathcal{Z} $ .Figure 4 shows the CEPC coverage (left panel) and detection precision (right panel) in the
$ \mathrm{Br}(S\to\tau^+\tau^-) $ versus$ C_{SZZ} $ planes for$ \sqrt{s} = 210 $ GeV (solid line) and$ 240 $ GeV (dashed line), with an integrated luminosity of$ L = 20\; \mathrm{ab}^{-1} $ . Here,$ C_{SZZ} $ denotes the reduced coupling of$ SZZ $ in the N2HDM-Flipped to its SM value. The left panel presents the expected$ 2\sigma $ and$ 5\sigma $ significance contours. The right panel displays the relative statistical precision on the signal yield, evaluated as
Figure 4. (color online) CEPC sensitivity in the
$ \mathrm{Br}(S\to\tau^+\tau^-) $ versus$ C_{SZZ} $ planes. The left panel shows the expected$ 2\sigma $ and$ 5\sigma $ coverage, and the right panel shows the relative statistical precision of the signal yield. Results are presented for$ \sqrt{s}=210 $ GeV (solid) and$ \sqrt{s}=240 $ GeV (dashed), assuming an integrated luminosity of$ L=20\; \mathrm{ab}^{-1} $ .$ \delta_{\rm stat}=\frac{\sqrt{(\mathcal S+\mathcal B)L}}{\mathcal S L}, $
(6) where
$ \mathcal S $ and$ \mathcal B $ denote the signal and background cross sections, respectively. The primary sources of systematic uncertainty include τ-jet reconstruction, jet energy scale and resolution, and theoretical uncertainties in the cross-section and shape predictions. Based on the CEPC Technical Design Report and related studies [39, 85, 86], we estimate the overall systematic uncertainty to be approximately 2%. Therefore, the statistical uncertainty dominates until it decreases below this level. As the production rate in the Higgs-strahlung channel$ e^+e^-\to ZS(\to\tau^+\tau^-) $ is controlled by both the$ ZS $ coupling and branching fraction of$ S\to\tau^+\tau^- $ , it is convenient to introduce the signal strength$ \mu_{\tau\tau}^{ZS} \equiv C_{SZZ}^2 \cdot \frac{\mathrm{Br}(S\to\tau^+\tau^-)}{\mathrm{Br}_{\text{SM}}(h_{95} \to \tau^+\tau^-)}. $
(7) This single parameter encapsulates the combined dependence on
$ C_{SZZ} $ and$ \mathrm{Br}(S\to\tau^+\tau^-) $ and is used to present the CEPC sensitivity. In this parameterization, CEPC can investigate$ \mu_{\tau\tau}^{ZS} \gt 0.016 $ ($ 0.020 $ ) at$ 5\sigma $ and$ \gt 0.006 $ ($ 0.008 $ ) at$ 2\sigma $ for$ \sqrt{s} = 210 $ ($ 240 $ ) GeV. The right panel shows that percent-level precision of$ \mu_{\tau\tau}^{ZS} $ can be achieved once this quantity reaches$ \mathcal{O}(10^{-2}) $ . The corresponding values of$ \mu_{\tau\tau}^{ZS} $ that are required to obtain the selected precision targets are summarized in Table 2. Furthermore, Ref. [37] indicates that the HL-LHC, with an integrated luminosity of 3 ab$ ^{-1} $ , can reach a$ 5\sigma $ discovery for the process$ pp\to t\bar{t}S(\to \gamma\gamma) $ , provided that the cross section exceeds approximately 0.3 fb. In addition, a 5 ab$ ^{-1} $ CEPC run is projected to be able to cover the current$ \sim 2\sigma $ excess at the$ 5\sigma $ level through the channel$ e^+e^-\to Z(\mu^+\mu^-)S(\to b\bar{b}) $ [39]. Together with our present study, these projections outline a promising and complementary path toward probing the potential BSM origin of the 95 GeV resonance at future colliders.Target
PrecisionRequired $ \mu_{\tau\tau}^{ZS} $
(210 GeV)Required $ \mu_{\tau\tau}^{ZS} $
(240 GeV)10% 0.04 0.05 5% 0.10 0.12 3% 0.22 0.26 2% 0.44 0.52 Table 2. Signal strength
$ \mu_{\tau\tau}^{ZS} $ required to reach the indicated relative statistical precision of the signal yield at$ \sqrt{s}=210 $ and$ 240 $ GeV with$ L=20\; \mathrm{ab}^{-1} $ . -
Despite the baseline cuts, the signal remains obscured by irreducible backgrounds, particularly from the
$ Z\tau\tau $ process. To further enhance sensitivity, we apply machine learning (ML) techniques at the benchmark points ($ M_S = 95.5 $ GeV,$ \sqrt{s} = 210 $ and 240 GeV), comparing three classifiers: gradient-boosted decision trees (GBDT), eXtreme Gradient Boosting (XGBoost), and deep neural networks (DNN).A set of 22 kinematic features is used as input, including:
● Four-momenta of
$ \mu_1 $ ,$ \mu_2 $ , and$ \tau_1 $ :$ E_{\mu_1,\mu_2,\tau_1} $ ,$ p_x^{\mu_1,\mu_2,\tau_1} $ ,$ p_y^{\mu_1,\mu_2,\tau_1} $ , and$ p_z^{\mu_1,\mu_2,\tau_1} $ .● Transverse momenta and pseudorapidities of
$ \mu_1 $ ,$ \mu_2 $ , and$ \tau_1 $ :$ p_\mathrm{T}^{\mu_1,\mu_2,\tau_1} $ and$ \eta_{\mu_1,\mu_2,\tau_1} $ .● Angular separations
$ \Delta R $ among$ \mu_1 $ ,$ \mu_2 $ , and$ \tau_1 $ :$ \Delta R[\mu_1, \mu_2] $ ,$ \Delta R[\mu_1, \tau_1] $ and$ \Delta R[\mu_2, \tau_1] $ ● The recoil mass
$ M_{\mathrm{recoil}} $ .The DNN architecture consists of six hidden layers with 64, 48, 32, 24, 16, and 8 neurons, respectively. To mitigate overfitting and vanishing gradients, dropout with a rate of 20% is applied to the first two layers, and each hidden layer uses the Rectified Linear Unit (ReLU) activation function [84]. The output layer contains a single neuron with a sigmoid activation. Training is performed using the Adam optimizer with a learning rate of 0.0001. Approximately 200k events are used for training and evaluation, with 70% for training and 30% for testing. The training and testing were performed on an RTX 3060 GPU, and the DNN typically converges within minutes. The model reaches a test accuracy of about 77% after 10 epochs. Finally, it reaches a classification accuracy of 78% (77%) and an area under the ROC curve (AUC) of 0.87 (0.85) at
$ \sqrt{s}=210\ (240) $ GeV. Further increasing the size of the training set mainly improves the stability of the training process, while offering limited gains in classification accuracy. Significant additional improvements would likely require more advanced ML architectures or direct access to particle-level information, which we leave for future work.Fig. 3 shows the integrated luminosity required to achieve a
$ 5\sigma $ significance at the benchmark point, as a function of the signal efficiency, before and after applying ML classifiers. Here, the signal efficiency is defined as the true positive rate (TPR),
Figure 3. (color online) Integrated luminosity required to reach
$ 5\sigma $ significance at the benchmark point, as a function of the signal efficiency. Solid (dotted) curves correspond to$ \sqrt{s} = 210\ (240) $ GeV. Colors denote different selection strategies: baseline (before applying ML classifiers, green), DNN (red), GBDT (yellow), and XGBoost (blue).$ \mathrm{TPR} = \frac{N_{\mathrm{TP}}}{N_{\mathrm{TP}}+N_{\mathrm{FN}}} , $
(5) where
$ N_{\mathrm{TP}} $ is the number of signal events correctly classified and$ N_{\mathrm{FN}} $ is the number of signal events misclassified as background. This quantity characterizes the efficiency of the ML classifier in recognizing signal events. The green, red, yellow, and blue curves (dotted for$ \sqrt{s} = 240 $ GeV) correspond to the baseline selection (before applying ML classifiers), DNN, GBDT, and XGBoost, respectively, at$ \sqrt{s} = 210 $ GeV. At each energy, the DNN consistently requires the lowest luminosity for any given signal efficiency. The minimum required luminosity is attained at a signal efficiency of approximately 0.66 (0.74) for$ \sqrt{s} = 210\ (240) $ GeV with DNN, reducing the required luminosity from$ 480\ (690)\;\mathrm{fb}^{-1} $ in the baseline to less than$ 210\ (310)\;\mathrm{fb}^{-1} $ . The full cut flow with$ L = 500\;\mathrm{fb}^{-1} $ is summarized in Table 1.Cuts $ ZS $ $ Z\tau\tau $ $ Zjj $ $ \mathcal{Z} [\sigma] $ Initial 134 (109) 1819 (1769) 24187 (21745) 0.83 (0.71) Basic 59.7 (52.7) 780.2 (816.7) 49.4 (4.7) 2.0 (1.8) $ M_{\mathrm{recoil}} $ 47.4 (38.7) 66.0 (66.0) 5.2 (4.7) 5.1 (4.3) ML 31.5 (28.6) 8.2 (11.5) 0.6 (0.8) 7.7 (6.4) Table 1. Cut flow of signal and background events (in units of events per 500 fb
$ ^{-1} $ ) for$ \sqrt{s} = 210 $ (240) GeV. Numbers in parentheses refer to the 240 GeV case. The final column shows the resulting signal significance$ \mathcal{Z} $ .Fig. 4 shows the CEPC coverage (left panel) and detection precision (right panel) in the
$ \mathrm{Br}(S\to\tau^+\tau^-) $ versus$ C_{SZZ} $ planes, for$ \sqrt{s} = 210 $ GeV (solid line) and$ 240 $ GeV (dashed line), with an integrated luminosity of$ L = 20\; \mathrm{ab}^{-1} $ . Here$ C_{SZZ} $ denotes the reduced coupling of$ SZZ $ in the N2HDM-Flipped to its SM value. The left panel presents the expected$ 2\sigma $ and$ 5\sigma $ significance contours. While the right panel displays the relative statistical precision on the signal yield, evaluated as
Figure 4. (color online) CEPC sensitivity in the
$ \mathrm{Br}(S\to\tau^+\tau^-) $ versus$ C_{SZZ} $ planes. The left panel shows the expected$ 2\sigma $ and$ 5\sigma $ coverage, and the right panel shows the relative statistical precision on the signal yield. Results are presented for$ \sqrt{s}=210 $ GeV (solid) and$ \sqrt{s}=240 $ GeV (dashed), assuming an integrated luminosity of$ L=20\; \mathrm{ab}^{-1} $ .$ \delta_{\rm stat}=\frac{\sqrt{(\mathcal S+\mathcal B)L}}{\mathcal S L}, $
(6) where
$ \mathcal S $ and$ \mathcal B $ denote the signal and background cross sections. The primary sources of systematic uncertainty include τ-jet reconstruction, jet energy scale and resolution, and theoretical uncertainties in the cross-section and shape predictions. Based on the CEPC Technical Design Report and related studies [39, 85, 86], we estimate the overall systematic uncertainty to be about 2%. Therefore, the statistical uncertainty dominates until it drops below this level. Since the production rate in the Higgsstrahlung channel$ e^+e^-\to ZS(\to\tau^+\tau^-) $ is controlled by both the$ ZS $ coupling and the branching fraction of$ S\to\tau^+\tau^- $ , it is convenient to introduce the signal strength$ \mu_{\tau\tau}^{ZS} \equiv C_{SZZ}^2 \cdot \frac{\mathrm{Br}(S\to\tau^+\tau^-)}{\mathrm{Br}_{\text{SM}}(h_{95} \to \tau^+\tau^-)}. $
(7) This single parameter encapsulates the combined dependence on
$ C_{SZZ} $ and$ \mathrm{Br}(S\to\tau^+\tau^-) $ , and will be used to present the CEPC sensitivity. In this parameterization, CEPC can probe$ \mu_{\tau\tau}^{ZS} > 0.016 $ ($ 0.020 $ ) at$ 5\sigma $ , and$ > 0.006 $ ($ 0.008 $ ) at$ 2\sigma $ for$ \sqrt{s} = 210 $ ($ 240 $ ) GeV. The right panel shows that percent-level precision on$ \mu_{\tau\tau}^{ZS} $ can be achieved once this quantity reaches$ \mathcal{O}(10^{-2}) $ . The corresponding values of$ \mu_{\tau\tau}^{ZS} $ required to obtain selected precision targets are summarized in Table 2. Furthermore, Ref. [37] indicates that the HL-LHC, with an integrated luminosity of 3 ab$ ^{-1} $ , could reach a$ 5\sigma $ discovery for the process$ pp\to t\bar{t}S(\to \gamma\gamma) $ provided the cross section exceeds around 0.3 fb. In addition, a 5 ab$ ^{-1} $ CEPC run is projected to be able to cover the current$ \sim 2\sigma $ excess at the$ 5\sigma $ level through the channel$ e^+e^-\to Z(\mu^+\mu^-)S(\to b\bar{b}) $ [39]. Together with our present study, these projections outline a promising and complementary path toward probing the potential BSM origin of the 95 GeV resonance at future colliders.Target Required $ \mu_{\tau\tau}^{ZS} $ Required $ \mu_{\tau\tau}^{ZS} $ precision (210 GeV) (240 GeV) 10% 0.04 0.05 5% 0.10 0.12 3% 0.22 0.26 2% 0.44 0.52 Table 2. Signal strength
$ \mu_{\tau\tau}^{ZS} $ required to reach the indicated relative statistical precision on the signal yield at$ \sqrt{s}=210 $ and$ 240 $ GeV with$ L=20\; \mathrm{ab}^{-1} $ . -
Despite the baseline cuts, the signal remains obscured by irreducible backgrounds, particularly from the
$ Z\tau\tau $ process. To enhance the sensitivity, we apply ML techniques at the benchmark points ($ M_S = 95.5 $ GeV,$ \sqrt{s} = 210 $ and 240 GeV), comparing three classifiers: XGBoost, GBDT, and DNN.A set of 22 kinematic features is used as input, including:
● Four-momenta of
$ \mu_1 $ ,$ \mu_2 $ , and$ \tau_1 $ :$ E_{\mu_1,\mu_2,\tau_1} $ ,$ p_x^{\mu_1,\mu_2,\tau_1} $ ,$ p_y^{\mu_1,\mu_2,\tau_1} $ , and$ p_z^{\mu_1,\mu_2,\tau_1} $ .● Transverse momenta and pseudorapidities of
$ \mu_1 $ ,$ \mu_2 $ , and$ \tau_1 $ :$ p_\mathrm{T}^{\mu_1,\mu_2,\tau_1} $ and$ \eta_{\mu_1,\mu_2,\tau_1} $ .● Angular separations
$ \Delta R $ among$ \mu_1 $ ,$ \mu_2 $ , and$ \tau_1 $ :$ \Delta R[\mu_1, \mu_2] $ ,$ \Delta R[\mu_1, \tau_1] $ , and$ \Delta R[\mu_2, \tau_1] $ ● The recoil mass
$ M_{\mathrm{recoil}} $ .The DNN architecture consists of six hidden layers with 64, 48, 32, 24, 16, and 8 neurons, respectively. To mitigate overfitting and vanishing gradients, dropout with a rate of 20% is applied to the first two layers, and each hidden layer uses the Rectified Linear Unit (ReLU) activation function [84]. The output layer contains a single neuron with sigmoid activation. Training is performed using the Adam optimizer with a learning rate of 0.0001. Approximately 200k events are used for training and evaluation, with 70% for training and 30% for testing. The training and testing are performed on an RTX 3060 GPU, and the DNN typically converges within minutes. The model reaches a test accuracy of approximately 77% after 10 epochs. Finally, it reaches a classification accuracy of 78% (77%) and an area under the ROC curve (AUC) of 0.87 (0.85) at
$ \sqrt{s}=210\ (240) $ GeV. Inreasing the size of the training set further mainly improves the stability of the training process while offering limited gains in classification accuracy. Significant additional improvements would likely require more advanced ML architectures or direct access to particle-level information, which we leave for future work.Figure 3 shows the integrated luminosity required to achieve a
$ 5\sigma $ significance at the benchmark point as a function of the signal efficiency before and after applying the ML classifiers. Here, the signal efficiency is defined as the true positive rate (TPR):
Figure 3. (color online) Integrated luminosity required to reach
$ 5\sigma $ significance at the benchmark point as a function of the signal efficiency. The solid (dotted) curves correspond to$ \sqrt{s} = $ $ 210\ (240) $ GeV. The colors denote different selection strategies: baseline (before applying ML classifiers; green), DNN (red), GBDT (yellow), and XGBoost (blue).$ \mathrm{TPR} = \frac{N_{\mathrm{TP}}}{N_{\mathrm{TP}}+N_{\mathrm{FN}}} , $
(5) where
$ N_{\mathrm{TP}} $ is the number of correctly classified signal events, and$ N_{\mathrm{FN}} $ is the number of signal events that are misclassified as background. This quantity characterizes the efficiency of the ML classifier in recognizing signal events. The green, red, yellow, and blue curves (dotted for$ \sqrt{s} = 240 $ GeV) correspond to the baseline selection (before applying the ML classifiers), DNN, GBDT, and XGBoost, respectively, at$ \sqrt{s} = 210 $ GeV. At each energy, the DNN consistently requires the lowest luminosity for any given signal efficiency. The minimum required luminosity is attained at a signal efficiency of approximately 0.66 (0.74) for$ \sqrt{s} = 210\ (240) $ GeV with the DNN, reducing the required luminosity from$ 480\ (690)\;\mathrm{fb}^{-1} $ in the baseline to less than$ 210\ (310)\;\mathrm{fb}^{-1} $ . The full cut flow with$ L = 500\;\mathrm{fb}^{-1} $ is summarized in Table 1.Cuts $ ZS $ $ Z\tau\tau $ $ Zjj $ $ \mathcal{Z} [\sigma] $ Initial 134 (109) 1819 (1769) 24187 (21745) 0.83 (0.71) Basic 59.7 (52.7) 780.2 (816.7) 49.4 (4.7) 2.0 (1.8) $ M_{\mathrm{recoil}} $ 47.4 (38.7) 66.0 (66.0) 5.2 (4.7) 5.1 (4.3) ML 31.5 (28.6) 8.2 (11.5) 0.6 (0.8) 7.7 (6.4) Table 1. Cut flow of signal and background events (in units of events per 500 fb
$ ^{-1} $ ) for$ \sqrt{s} = 210 $ (240) GeV. The numbers in parentheses refer to the 240 GeV case. The final column lists the resulting signal significance$ \mathcal{Z} $ .Figure 4 shows the CEPC coverage (left panel) and detection precision (right panel) in the
$ \mathrm{Br}(S\to\tau^+\tau^-) $ versus$ C_{SZZ} $ planes for$ \sqrt{s} = 210 $ GeV (solid line) and$ 240 $ GeV (dashed line), with an integrated luminosity of$ L = 20\; \mathrm{ab}^{-1} $ . Here,$ C_{SZZ} $ denotes the reduced coupling of$ SZZ $ in the N2HDM-Flipped to its SM value. The left panel presents the expected$ 2\sigma $ and$ 5\sigma $ significance contours. The right panel displays the relative statistical precision on the signal yield, evaluated as
Figure 4. (color online) CEPC sensitivity in the
$ \mathrm{Br}(S\to\tau^+\tau^-) $ versus$ C_{SZZ} $ planes. The left panel shows the expected$ 2\sigma $ and$ 5\sigma $ coverage, and the right panel shows the relative statistical precision of the signal yield. Results are presented for$ \sqrt{s}=210 $ GeV (solid) and$ \sqrt{s}=240 $ GeV (dashed), assuming an integrated luminosity of$ L=20\; \mathrm{ab}^{-1} $ .$ \delta_{\rm stat}=\frac{\sqrt{(\mathcal S+\mathcal B)L}}{\mathcal S L}, $
(6) where
$ \mathcal S $ and$ \mathcal B $ denote the signal and background cross sections, respectively. The primary sources of systematic uncertainty include τ-jet reconstruction, jet energy scale and resolution, and theoretical uncertainties in the cross-section and shape predictions. Based on the CEPC Technical Design Report and related studies [39, 85, 86], we estimate the overall systematic uncertainty to be approximately 2%. Therefore, the statistical uncertainty dominates until it decreases below this level. As the production rate in the Higgs-strahlung channel$ e^+e^-\to ZS(\to\tau^+\tau^-) $ is controlled by both the$ ZS $ coupling and branching fraction of$ S\to\tau^+\tau^- $ , it is convenient to introduce the signal strength$ \mu_{\tau\tau}^{ZS} \equiv C_{SZZ}^2 \cdot \frac{\mathrm{Br}(S\to\tau^+\tau^-)}{\mathrm{Br}_{\text{SM}}(h_{95} \to \tau^+\tau^-)}. $
(7) This single parameter encapsulates the combined dependence on
$ C_{SZZ} $ and$ \mathrm{Br}(S\to\tau^+\tau^-) $ and is used to present the CEPC sensitivity. In this parameterization, CEPC can investigate$ \mu_{\tau\tau}^{ZS} \gt 0.016 $ ($ 0.020 $ ) at$ 5\sigma $ and$ \gt 0.006 $ ($ 0.008 $ ) at$ 2\sigma $ for$ \sqrt{s} = 210 $ ($ 240 $ ) GeV. The right panel shows that percent-level precision of$ \mu_{\tau\tau}^{ZS} $ can be achieved once this quantity reaches$ \mathcal{O}(10^{-2}) $ . The corresponding values of$ \mu_{\tau\tau}^{ZS} $ that are required to obtain the selected precision targets are summarized in Table 2. Furthermore, Ref. [37] indicates that the HL-LHC, with an integrated luminosity of 3 ab$ ^{-1} $ , can reach a$ 5\sigma $ discovery for the process$ pp\to t\bar{t}S(\to \gamma\gamma) $ , provided that the cross section exceeds approximately 0.3 fb. In addition, a 5 ab$ ^{-1} $ CEPC run is projected to be able to cover the current$ \sim 2\sigma $ excess at the$ 5\sigma $ level through the channel$ e^+e^-\to Z(\mu^+\mu^-)S(\to b\bar{b}) $ [39]. Together with our present study, these projections outline a promising and complementary path toward probing the potential BSM origin of the 95 GeV resonance at future colliders.Target
PrecisionRequired $ \mu_{\tau\tau}^{ZS} $
(210 GeV)Required $ \mu_{\tau\tau}^{ZS} $
(240 GeV)10% 0.04 0.05 5% 0.10 0.12 3% 0.22 0.26 2% 0.44 0.52 Table 2. Signal strength
$ \mu_{\tau\tau}^{ZS} $ required to reach the indicated relative statistical precision of the signal yield at$ \sqrt{s}=210 $ and$ 240 $ GeV with$ L=20\; \mathrm{ab}^{-1} $ . -
To demonstrate the broader applicability of our ML approach, we examine the sensitivity of the CEPC to the parameter space of the N2HDM-Flipped. The same strategy can also be straightforwardly applied to other models with an extended Higgs sector, such as the next-to-minimal supersymmetric model (NMSSM). The N2HDM-Flipped extends the SM with a second Higgs doublet and a real singlet, where the Yukawa coupling of charged leptons is ''flipped'' from down- (in the Type-II N2HDM or NMSSM) to up-quark-like [87−89]. A complete definition of the N2HDM-Flipped is provided in Section II of Ref. [24], which details its particle content, coupling structure, and differences from scenarios of other types. Ref. [32] shows that the N2HDM-Flipped scenario can accommodate the excesses observed in several channels. The present study further evaluates the CEPC detection potential for the surviving parameter space of this model.
Following electroweak symmetry breaking, three neutral CP-even states
$ H_{a,b,c} $ , a pseudoscalar A, and a charged pair$ H^\pm $ exist in the N2HDM-Flipped. We identify$ H_a $ using the 125 GeV SM like Higgs boson and consider$ H_b $ as the putative 95 GeV resonance. In total, 11 new parameters need to be defined: the masses of$ H_{a,b,c} $ , A, and$ H^\pm $ ; the ratio of the vacuum expected values (VEV) of two Higgs doublets tanβ; the effective couplings of$ H_a $ to massive gauge bosons ($ C_{H_aVV}^2 $ ) and top-quarks ($ C_{H_att}^2 $ ); the mixing matrix elements between$ H_{a,b} $ and the singlet field ($ R_{a3}^{{\mathrm{in}}} $ and$ R_{b3}^{\mathrm{in}} $ ); the soft$ Z_2 $ -breaking parameter$ m_{12}^2 $ ; and the singlet VEV$ v_S $ . The 11 independent parameters are scanned in the following ranges:$ \begin{array}{*{20}{c}} {\begin{array}{*{20}{l}} {\ \ \ 95 \lt m_{H_b} \lt 96\; \text{GeV},}\\ {\ \ 300 \lt m_{H_{c},A} \lt 1500\; \text{GeV}, }\\ {\ \ 0.8 \lt \tan\beta \lt 10, }\\ {0.70 \lt C_{H_aVV}^2 \lt 1.00, }\\ {10^{-3} \lt m_{12}^2 \lt 5\times10^{5}\; \text{GeV}^2, } \end{array}}&{\begin{array}{*{20}{l}} {\ \ \ \ \ \text{sign}(R^{\text{in}}_{a3}) = \pm1, }\\ {\ \ 580 \lt m_{H^\pm} \lt 1500\; \text{GeV}, }\\ {\ -1 \lt R^{\text{in}}_{b3} \lt 1, }\\ {0.70 \lt C_{H_att}^2 \lt 1.20,}\\ {\ \ 1 \lt v_S \lt 3000\; \text{GeV}. } \end{array}} \end{array}$
(8) For each parameter point, we impose theoretical consistency conditions, including perturbative unitarity, vacuum stability, and perturbativity, as well as current experimental constraints from Higgs, flavor, and electroweak precision observables. The parameter scan is performed using
$ {\textsf{ScannerS}}\_{\textsf{v2.0.0}}$ [90, 91], which interfaces with$ {\textsf{HiggsBounds}}\_{\textsf{v5.10.0}}$ [92, 93],$ {\textsf{HiggsSignals}}\_{\textsf{v2.6.0}}$ [94, 95], and$ {\textsf{SuperIso}}\_{\textsf{v4.1}}$ [96, 97] to evaluate physical observables and apply the relevant constraints [89, 98−104]. This study uses a profiled likelihood ratio test with the SM as the alternative hypothesis, defining$ \Delta\chi^2 = \chi^2_{\mathrm{N2HDM}}-\chi^2_{\mathrm{SM}} $ and requiring$ \Delta\chi^2 \lt 6.18 $ at a 95% confidence level, where$ \chi^2_{\mathrm{N2HDM}} $ and$ \chi^2_{\mathrm{SM}} $ are provided by HiggsSignals, to evaluate the compatibility of the 125 GeV Higgs with the experimental data [95].We further perform a
$ \chi^2_{h_{95}} $ analysis to evaluate the compatibility of the N2HDM-Flipped with the observed excesses in the$ \tau^+\tau^- $ ,$ b\bar{b} $ , and$ \gamma\gamma $ channels. This is not a discovery test but rather a goodness-of-fit check to determine whether the model is compatible with the existing experimental hints.$ \chi^2_{h_{95}} $ is defined as$ \chi^2_{h_{95}} = \sum\limits_{xx=\tau\tau,\,\gamma\gamma,\,bb} \frac{(\mu_{xx} - \mu_{xx}^{\mathrm{exp}})^2}{(\Delta \mu_{xx}^{\mathrm{exp}})^2}, $
(9) where
$ \mu_{xx} $ denotes the model prediction, and$ \mu_{xx}^{\mathrm{exp}} \pm \Delta \mu_{xx}^{\mathrm{exp}} $ are the experimentally measured signal strengths and uncertainties, respectively, as specified in Eq. (1). The uncertainties of the three excesses are treated at the$ 1\sigma $ level, and the requirement$ \chi^2_{h_{95}} \lt 7.82 $ corresponds to consistency with the experimental results at the 95% confidence level for three degrees of freedom, indicating that the model is not excluded by current data within this statistical tolerance.After applying the abovementioned constraints, surviving samples with
$ \chi_{h_{95}}^2<7.82 $ in the$ \mu_{\tau\tau} $ versus$ \mu_{\gamma\gamma} $ plane for$ \sqrt{s} = 210 $ (left) and$ 240 $ (right) GeV are shown in Fig. 5. The CEPC with$ L=100,\; 200,\; 800 {\; {\rm{fb}}}^{-1},\; \mathrm{and}\; 1.22 {\; {\rm{ab}}}^{-1} $ can cover the red, yellow, blue, and green samples at the$ 5\sigma $ level, respectively. All surviving samples can be covered at$ 5\sigma $ for$ \sqrt{s} = 210 $ GeV with$ L=800 {\; {\rm{fb}}}^{-1} $ and$ \sqrt{s} = 240 $ GeV with$ L=1.22 {\; {\rm{ab}}}^{-1} $ . The green and purple regions in Fig. 5 indicate the$ 1\sigma $ excesses in the$ \tau^+\tau^- $ and$ \gamma\gamma $ channels, respectively, as reported by experiments. The best-fit point, marked by a red star, yields$ \chi^2_{h_{95}} = \; 0.24 $ , corresponding to a p-value of 0.971. For this point, a$ 2\sigma $ sensitivity can be achieved with$ L = 10\, {\; {\rm{fb}}}^{-1} $ and$ 5\sigma $ with$ L = 58\, {\; {\rm{fb}}}^{-1} $ at$ \sqrt{s} = 210 $ GeV. At$ \sqrt{s} = 240 $ GeV, the same levels of sensitivity require$ L = 13\, {\; {\rm{fb}}}^{-1} $ and$ L =80\, {\; {\rm{fb}}}^{-1} $ , respectively. Apart from the sensitivity projections in Fig. 5 and the associated discussion, our analysis is independent of the specific selection of$ \mu_{\tau\tau} $ . A smaller consistent$ \mu_{\tau\tau} $ would require a correspondingly higher integrated luminosity to maintain equivalent coverage of the parameter space that is compatible with the current excesses. These results demonstrate that the$ e^+e^- \to Z(\to \mu^+\mu^-) S(\to \tau^+\tau^-) $ channel at the CEPC offers excellent potential to probe the 95 GeV excess in the N2HDM-Flipped framework.
Figure 5. (color online) Coverage ability of the CEPC for surviving samples in the
$ \mu_{\tau\tau} $ versus$ \mu_{\gamma\gamma} $ plane at$ \sqrt{s}=210 $ GeV (left panel) and$ \sqrt{s}=240 $ GeV (right panel). The CEPC with$ L=100,\; 200,\; 800 {\; {\rm{fb}}}^{-1},\; \mathrm{and}\; 1.22 {\; {\rm{ab}}}^{-1} $ can cover the red, yellow, blue, and green samples at the$ 5\sigma $ level, respectively. All surviving samples can be covered at$ 5\sigma $ for$ \sqrt{s} = 210 $ GeV with$ L=800 {\; {\rm{fb}}}^{-1} $ and$ \sqrt{s} = 240 $ GeV with$ L=1.22 {\; {\rm{ab}}}^{-1} $ . The red star marks the best-fit point. The green and purple shaded bands indicate the experimental$ 1\sigma $ ranges for the$ \tau^+\tau^- $ and$ \gamma\gamma $ channels, respectively. -
To demonstrate the broader applicability of our ML approach, we examine the CEPC’s sensitivity to the parameter space of the flipped Next-to-2HDM (N2HDM-Flipped). The same strategy can also be straightforwardly applied to other models with an extended Higgs sector, such as the next-to-minimal supersymmetric model (NMSSM). N2HDM-Flipped extends the SM with a second Higgs doublet and a real singlet, where the Yukawa coupling of charged leptons is ”flipped” from down- (in Type-II N2HDM or NMSSM) to up-quark-like [87−89]. A complete definition of the N2HDM-Flipped is given in Section II of Ref. [24], detailing its particle content, coupling structure, and differences from scenarios of other Types. Ref. [32] shows that the N2HDM-Flipped scenario can accommodate the excesses observed in several channels. The present work further evaluates the CEPC detection potential for the surviving parameter space of this model.
After electroweak symmetry breaking, there are three neutral CP-even states
$ H_{a,b,c} $ , a pseudoscalar A, and a charged pair$ H^\pm $ in the N2HDM-Flipped. We identify$ H_a $ with the 125 GeV SM–like Higgs boson and take$ H_b $ as the putative 95 GeV resonance. 11 new parameters that need to be defined, including the mass of$ H_{a,b,c} $ , A, and$ H^\pm $ ; the ratio of the vacuum expected values (VEV) of two Higgs doublets tanβ; the effective couplings of$ H_a $ to massive gauge bosons ($ C_{H_aVV}^2 $ ) and top-quarks($ C_{H_att}^2 $ ); the mixing matrix elements between$ H_{a,b} $ and the singlet field ($ R_{a3}^{{\mathrm{in}}} $ and$ R_{b3}^{\mathrm{in}} $ ); the soft$ Z_2 $ -breaking parameter$ m_{12}^2 $ ; the singlet VEV$ v_S $ . The 11 independent parameters are scanned in the ranges:$ \begin{array}{*{20}{c}} {\begin{array}{*{20}{l}} {\ \ \ 95 < m_{H_b} < 96\; \text{GeV},}\\ {\ \ 300 < m_{H_{c},A} < 1500\; \text{GeV}, }\\ {\ \ 0.8 < \tan\beta < 10, }\\ {0.70 < C_{H_aVV}^2 < 1.00, }\\ {10^{-3} < m_{12}^2 < 5\times10^{5}\; \text{GeV}^2, } \end{array}}&{\begin{array}{*{20}{l}} {\ \ \ \ \ \text{sign}(R^{\text{in}}_{a3}) = \pm1, }\\ {\ \ 580 < m_{H^\pm} < 1500\; \text{GeV}, }\\ {\ -1 < R^{\text{in}}_{b3} < 1, }\\ {0.70 < C_{H_att}^2 < 1.20,}\\ {\ \ 1 < v_S < 3000\; \text{GeV}. } \end{array}} \end{array}$
(8) For each parameter point, we impose theoretical consistency conditions, including perturbative unitarity, vacuum stability, and perturbativity, as well as current experimental constraints from Higgs, flavor, and electroweak precision observables. The parameter scan is performed using
$ {\textsf{ScannerS}}\_{\textsf{v2.0.0}}$ [90, 91], which interfaces with$ {\textsf{HiggsBounds}}\_{\textsf{v5.10.0}}$ [92, 93],$ {\textsf{HiggsSignals}}\_{\textsf{v2.6.0}}$ [94, 95], and$ {\textsf{SuperIso}}\_{\textsf{v4.1}}$ [96, 97] to evaluate physical observables and apply the relevant constraints [89, 98−104]. This work uses a profiled likelihood ratio test with the SM as the alternative hypothesis, defining$ \Delta\chi^2 = \chi^2_{\mathrm{N2HDM}}-\chi^2_{\mathrm{SM}} $ and requiring$ \Delta\chi^2 < 6.18 $ at 95% confidence level, where$ \chi^2_{\mathrm{N2HDM}} $ and$ \chi^2_{\mathrm{SM}} $ are given by HiggsSignals to evaluate the compatibility of the 125 GeV Higgs with the experimental data [95].We further perform a
$ \chi^2_{h_{95}} $ analysis to evaluate the compatibility of the N2HDM-Flipped with the observed excesses in the$ \tau^+\tau^- $ ,$ b\bar{b} $ , and$ \gamma\gamma $ channels. This is not a discovery test, but rather a goodness-of-fit check to see whether the model is compatible with the existing experimental hints. The$ \chi^2_{h_{95}} $ is defined as$ \chi^2_{h_{95}} = \sum\limits_{xx=\tau\tau,\,\gamma\gamma,\,bb} \frac{(\mu_{xx} - \mu_{xx}^{\mathrm{exp}})^2}{(\Delta \mu_{xx}^{\mathrm{exp}})^2}, $
(9) where
$ \mu_{xx} $ denotes the model prediction and$ \mu_{xx}^{\mathrm{exp}} \pm \Delta \mu_{xx}^{\mathrm{exp}} $ are the experimentally measured signal strengths and uncertainties, respectively, as specified in Eq. (1). The uncertainties of the three excesses are treated at the$ 1\sigma $ level, and for three degrees of freedom the requirement$ \chi^2_{h_{95}} < 7.82 $ corresponds to consistency with the experimental results at the 95% confidence level, indicating that the model is not excluded by current data within this statistical tolerance.After applying the constraints mentioned above, surviving samples with
$ \chi_{h_{95}}^2<7.82 $ are shown in Fig. 5 in the$ \mu_{\tau\tau} $ versus$ \mu_{\gamma\gamma} $ plane$ \sqrt{s} = 210 $ (left) and$ 240 $ (right) GeV. CEPC with$ L=100,\; 200,\; 800 {\; {\rm{fb}}}^{-1},\; \mathrm{and}\; 1.22 {\; {\rm{ab}}}^{-1} $ can cover the red, yellow, blue, and green samples at the$ 5\sigma $ level, respectively. All the surviving samples can be covered at$ 5\sigma $ for$ \sqrt{s} = 210 $ GeV with$ L=800 {\; {\rm{fb}}}^{-1} $ and$ \sqrt{s} = 240 $ GeV with$ L=1.22 {\; {\rm{ab}}}^{-1} $ . The green and purple regions in Fig. 5 indicate the$ 1\sigma $ excesses in the$ \tau^+\tau^- $ and$ \gamma\gamma $ channels, respectively, as reported by experiments. The best-fit point, marked by a red star, yields$ \chi^2_{h_{95}} = \; 0.24 $ , corresponding to a p-value of 0.971. For this point, a$ 2\sigma $ sensitivity can be achieved with$ L = 10\, {\; {\rm{fb}}}^{-1} $ and$ 5\sigma $ with$ L = 58\, {\; {\rm{fb}}}^{-1} $ at$ \sqrt{s} = 210 $ GeV. At$ \sqrt{s} = 240 $ GeV, the same levels of sensitivity require$ L = 13\, {\; {\rm{fb}}}^{-1} $ and$ L =80\, {\; {\rm{fb}}}^{-1} $ , respectively. Apart from the sensitivity projections in Fig. 5 and associated discussion, our analysis is independent of the specific choice of$ \mu_{\tau\tau} $ . A smaller consistent$ \mu_{\tau\tau} $ would require a correspondingly higher integrated luminosity to maintain equivalent coverage of the parameter space compatible with the current excesses. These results demonstrate that the$ e^+e^- \to Z(\to \mu^+\mu^-) S(\to \tau^+\tau^-) $ channel at CEPC offers excellent potential to probe the 95 GeV excess in the N2HDM-Flipped framework.
Figure 5. (color online) The coverage ability of CEPC for surviving samples in the
$ \mu_{\tau\tau} $ versus$ \mu_{\gamma\gamma} $ plane at$ \sqrt{s}=210 $ GeV (left panel) and$ \sqrt{s}=240 $ GeV (right panel). CEPC with$ L=100,\; 200,\; 800 {\; {\rm{fb}}}^{-1},\; \mathrm{and}\; 1.22 {\; {\rm{ab}}}^{-1} $ can cover the red, yellow, blue, and green samples at the$ 5\sigma $ level, respectively. All the surviving samples can be covered at$ 5\sigma $ for$ \sqrt{s} = 210 $ GeV with$ L=800 {\; {\rm{fb}}}^{-1} $ and$ \sqrt{s} = 240 $ GeV with$ L=1.22 {\; {\rm{ab}}}^{-1} $ . The red star marks the best-fit point. The green and purple shaded bands indicate the experimental$ 1\sigma $ ranges for the$ \tau^+\tau^- $ and$ \gamma\gamma $ channels, respectively. -
To demonstrate the broader applicability of our ML approach, we examine the sensitivity of the CEPC to the parameter space of the N2HDM-Flipped. The same strategy can also be straightforwardly applied to other models with an extended Higgs sector, such as the next-to-minimal supersymmetric model (NMSSM). The N2HDM-Flipped extends the SM with a second Higgs doublet and a real singlet, where the Yukawa coupling of charged leptons is ''flipped'' from down- (in the Type-II N2HDM or NMSSM) to up-quark-like [87−89]. A complete definition of the N2HDM-Flipped is provided in Section II of Ref. [24], which details its particle content, coupling structure, and differences from scenarios of other types. Ref. [32] shows that the N2HDM-Flipped scenario can accommodate the excesses observed in several channels. The present study further evaluates the CEPC detection potential for the surviving parameter space of this model.
Following electroweak symmetry breaking, three neutral CP-even states
$ H_{a,b,c} $ , a pseudoscalar A, and a charged pair$ H^\pm $ exist in the N2HDM-Flipped. We identify$ H_a $ using the 125 GeV SM like Higgs boson and consider$ H_b $ as the putative 95 GeV resonance. In total, 11 new parameters need to be defined: the masses of$ H_{a,b,c} $ , A, and$ H^\pm $ ; the ratio of the vacuum expected values (VEV) of two Higgs doublets tanβ; the effective couplings of$ H_a $ to massive gauge bosons ($ C_{H_aVV}^2 $ ) and top-quarks ($ C_{H_att}^2 $ ); the mixing matrix elements between$ H_{a,b} $ and the singlet field ($ R_{a3}^{{\mathrm{in}}} $ and$ R_{b3}^{\mathrm{in}} $ ); the soft$ Z_2 $ -breaking parameter$ m_{12}^2 $ ; and the singlet VEV$ v_S $ . The 11 independent parameters are scanned in the following ranges:$ \begin{array}{*{20}{c}} {\begin{array}{*{20}{l}} {\ \ \ 95 \lt m_{H_b} \lt 96\; \text{GeV},}\\ {\ \ 300 \lt m_{H_{c},A} \lt 1500\; \text{GeV}, }\\ {\ \ 0.8 \lt \tan\beta \lt 10, }\\ {0.70 \lt C_{H_aVV}^2 \lt 1.00, }\\ {10^{-3} \lt m_{12}^2 \lt 5\times10^{5}\; \text{GeV}^2, } \end{array}}&{\begin{array}{*{20}{l}} {\ \ \ \ \ \text{sign}(R^{\text{in}}_{a3}) = \pm1, }\\ {\ \ 580 \lt m_{H^\pm} \lt 1500\; \text{GeV}, }\\ {\ -1 \lt R^{\text{in}}_{b3} \lt 1, }\\ {0.70 \lt C_{H_att}^2 \lt 1.20,}\\ {\ \ 1 \lt v_S \lt 3000\; \text{GeV}. } \end{array}} \end{array}$
(8) For each parameter point, we impose theoretical consistency conditions, including perturbative unitarity, vacuum stability, and perturbativity, as well as current experimental constraints from Higgs, flavor, and electroweak precision observables. The parameter scan is performed using
$ {\textsf{ScannerS}}\_{\textsf{v2.0.0}}$ [90, 91], which interfaces with$ {\textsf{HiggsBounds}}\_{\textsf{v5.10.0}}$ [92, 93],$ {\textsf{HiggsSignals}}\_{\textsf{v2.6.0}}$ [94, 95], and$ {\textsf{SuperIso}}\_{\textsf{v4.1}}$ [96, 97] to evaluate physical observables and apply the relevant constraints [89, 98−104]. This study uses a profiled likelihood ratio test with the SM as the alternative hypothesis, defining$ \Delta\chi^2 = \chi^2_{\mathrm{N2HDM}}-\chi^2_{\mathrm{SM}} $ and requiring$ \Delta\chi^2 \lt 6.18 $ at a 95% confidence level, where$ \chi^2_{\mathrm{N2HDM}} $ and$ \chi^2_{\mathrm{SM}} $ are provided by HiggsSignals, to evaluate the compatibility of the 125 GeV Higgs with the experimental data [95].We further perform a
$ \chi^2_{h_{95}} $ analysis to evaluate the compatibility of the N2HDM-Flipped with the observed excesses in the$ \tau^+\tau^- $ ,$ b\bar{b} $ , and$ \gamma\gamma $ channels. This is not a discovery test but rather a goodness-of-fit check to determine whether the model is compatible with the existing experimental hints.$ \chi^2_{h_{95}} $ is defined as$ \chi^2_{h_{95}} = \sum\limits_{xx=\tau\tau,\,\gamma\gamma,\,bb} \frac{(\mu_{xx} - \mu_{xx}^{\mathrm{exp}})^2}{(\Delta \mu_{xx}^{\mathrm{exp}})^2}, $
(9) where
$ \mu_{xx} $ denotes the model prediction, and$ \mu_{xx}^{\mathrm{exp}} \pm \Delta \mu_{xx}^{\mathrm{exp}} $ are the experimentally measured signal strengths and uncertainties, respectively, as specified in Eq. (1). The uncertainties of the three excesses are treated at the$ 1\sigma $ level, and the requirement$ \chi^2_{h_{95}} \lt 7.82 $ corresponds to consistency with the experimental results at the 95% confidence level for three degrees of freedom, indicating that the model is not excluded by current data within this statistical tolerance.After applying the abovementioned constraints, surviving samples with
$ \chi_{h_{95}}^2<7.82 $ in the$ \mu_{\tau\tau} $ versus$ \mu_{\gamma\gamma} $ plane for$ \sqrt{s} = 210 $ (left) and$ 240 $ (right) GeV are shown in Fig. 5. The CEPC with$ L=100,\; 200,\; 800 {\; {\rm{fb}}}^{-1},\; \mathrm{and}\; 1.22 {\; {\rm{ab}}}^{-1} $ can cover the red, yellow, blue, and green samples at the$ 5\sigma $ level, respectively. All surviving samples can be covered at$ 5\sigma $ for$ \sqrt{s} = 210 $ GeV with$ L=800 {\; {\rm{fb}}}^{-1} $ and$ \sqrt{s} = 240 $ GeV with$ L=1.22 {\; {\rm{ab}}}^{-1} $ . The green and purple regions in Fig. 5 indicate the$ 1\sigma $ excesses in the$ \tau^+\tau^- $ and$ \gamma\gamma $ channels, respectively, as reported by experiments. The best-fit point, marked by a red star, yields$ \chi^2_{h_{95}} = \; 0.24 $ , corresponding to a p-value of 0.971. For this point, a$ 2\sigma $ sensitivity can be achieved with$ L = 10\, {\; {\rm{fb}}}^{-1} $ and$ 5\sigma $ with$ L = 58\, {\; {\rm{fb}}}^{-1} $ at$ \sqrt{s} = 210 $ GeV. At$ \sqrt{s} = 240 $ GeV, the same levels of sensitivity require$ L = 13\, {\; {\rm{fb}}}^{-1} $ and$ L =80\, {\; {\rm{fb}}}^{-1} $ , respectively. Apart from the sensitivity projections in Fig. 5 and the associated discussion, our analysis is independent of the specific selection of$ \mu_{\tau\tau} $ . A smaller consistent$ \mu_{\tau\tau} $ would require a correspondingly higher integrated luminosity to maintain equivalent coverage of the parameter space that is compatible with the current excesses. These results demonstrate that the$ e^+e^- \to Z(\to \mu^+\mu^-) S(\to \tau^+\tau^-) $ channel at the CEPC offers excellent potential to probe the 95 GeV excess in the N2HDM-Flipped framework.
Figure 5. (color online) Coverage ability of the CEPC for surviving samples in the
$ \mu_{\tau\tau} $ versus$ \mu_{\gamma\gamma} $ plane at$ \sqrt{s}=210 $ GeV (left panel) and$ \sqrt{s}=240 $ GeV (right panel). The CEPC with$ L=100,\; 200,\; 800 {\; {\rm{fb}}}^{-1},\; \mathrm{and}\; 1.22 {\; {\rm{ab}}}^{-1} $ can cover the red, yellow, blue, and green samples at the$ 5\sigma $ level, respectively. All surviving samples can be covered at$ 5\sigma $ for$ \sqrt{s} = 210 $ GeV with$ L=800 {\; {\rm{fb}}}^{-1} $ and$ \sqrt{s} = 240 $ GeV with$ L=1.22 {\; {\rm{ab}}}^{-1} $ . The red star marks the best-fit point. The green and purple shaded bands indicate the experimental$ 1\sigma $ ranges for the$ \tau^+\tau^- $ and$ \gamma\gamma $ channels, respectively. -
Our study identifies
$ \sqrt{s}=210 $ GeV as the optimal center-of-mass energy to study the hypothetical scalar with 95 GeV or a slightly higher masses. Without ML techniques, the benchmark point requires integrated luminosities of$ 480\,(690) \ {\rm{fb}} ^{-1} $ to reach$ 5\sigma $ significance at$ \sqrt{s}=210\,(240) $ GeV. A DNN classifier can reduce these numbers to$ 210\,(310) \ {\rm{fb}} ^{-1} $ at the corresponding energies. At$ L = 20\; \mathrm{ab}^{-1} $ , the CEPC's$ 5\sigma $ sensitivity to the signal strength$ \mu_{\tau\tau}^{ZS} $ reaches 0.016 and 0.020 for$ \sqrt{s} = 210 $ GeV and 240 GeV, respectively. The corresponding thresholds for a 5% precision measurement are$ \mu_{\tau\tau}^{ZS} > 0.10 $ and$ >0.12 $ . At$ \sqrt{s}=210 $ GeV (240 GeV),$ 5\sigma $ coverage of all N2HDM-Flipped samples with$ \chi^2_{h_{95}}<7.82 $ requires$ L=800\ \mathrm{fb}^{-1} $ (1.22$ \mathrm{ab}^{-1} $ ). These results suggest that an early CEPC run at 210 GeV, combined with modern machine-learning selection, offers the most efficient strategy for probing the 95 GeV excess. The analysis framework can be readily adapted to alternative lepton colliders such as the ILC and FCC-ee, and adapted to other light-scalar scenarios. -
Our study has identified
$ \sqrt{s}=210 $ GeV as the optimal center-of-mass energy to investigate a hypothetical scalar with a mass of 95 GeV or slightly higher. Without ML techniques, the benchmark point requires integrated luminosities of$ 480\,(690) \ {\rm{fb}} ^{-1} $ to reach a$ 5\sigma $ significance at$ \sqrt{s}=210\,(240) $ GeV. A DNN classifier can reduce these values to$ 210\,(310) \ {\rm{fb}} ^{-1} $ at the corresponding energies. At$ L = 20\; \mathrm{ab}^{-1} $ , the$ 5\sigma $ sensitivity of the CEPC to the signal strength$ \mu_{\tau\tau}^{ZS} $ reaches 0.016 and 0.020 for$ \sqrt{s} = 210 $ GeV and 240 GeV, respectively. The corresponding thresholds for a 5% precision measurement are$ \mu_{\tau\tau}^{ZS} \gt 0.10 $ and$ >0.12 $ . At$ \sqrt{s}=210 $ GeV (240 GeV),$ 5\sigma $ coverage of all N2HDM-Flipped samples with$ \chi^2_{h_{95}}<7.82 $ requires$ L=800\ \mathrm{fb}^{-1} $ (1.22$ \mathrm{ab}^{-1} $ ). These results suggest that an early CEPC run at 210 GeV, combined with modern ML selection, offers the most efficient strategy for probing the 95 GeV excess. The analysis framework can be readily adapted to alternative lepton colliders such as the ILC and FCC-ee and adjusted to other light-scalar scenarios. -
Our study has identified
$ \sqrt{s}=210 $ GeV as the optimal center-of-mass energy to investigate a hypothetical scalar with a mass of 95 GeV or slightly higher. Without ML techniques, the benchmark point requires integrated luminosities of$ 480\,(690) \ {\rm{fb}} ^{-1} $ to reach a$ 5\sigma $ significance at$ \sqrt{s}=210\,(240) $ GeV. A DNN classifier can reduce these values to$ 210\,(310) \ {\rm{fb}} ^{-1} $ at the corresponding energies. At$ L = 20\; \mathrm{ab}^{-1} $ , the$ 5\sigma $ sensitivity of the CEPC to the signal strength$ \mu_{\tau\tau}^{ZS} $ reaches 0.016 and 0.020 for$ \sqrt{s} = 210 $ GeV and 240 GeV, respectively. The corresponding thresholds for a 5% precision measurement are$ \mu_{\tau\tau}^{ZS} \gt 0.10 $ and$ >0.12 $ . At$ \sqrt{s}=210 $ GeV (240 GeV),$ 5\sigma $ coverage of all N2HDM-Flipped samples with$ \chi^2_{h_{95}}<7.82 $ requires$ L=800\ \mathrm{fb}^{-1} $ (1.22$ \mathrm{ab}^{-1} $ ). These results suggest that an early CEPC run at 210 GeV, combined with modern ML selection, offers the most efficient strategy for probing the 95 GeV excess. The analysis framework can be readily adapted to alternative lepton colliders such as the ILC and FCC-ee and adjusted to other light-scalar scenarios.
Testing a 95 GeV Scalar at the CEPC with Machine Learning
- Received Date: 2025-08-18
- Available Online: 2026-03-15
Abstract: Several possible excesses around $ 95 $ GeV hint at an additional light scalar beyond the Standard Model. We examine the capability of the Circular Electron-Positron Collider (CEPC) to test this hypothesis in the Higgs-strahlung channel $ e^{+}e^{-} \to ZS $ with $ Z \to\mu^{+}\mu^{-} $ and $ S \to\tau^{+}\tau^{-} $. A full detector simulation shows that the optimal center-of-mass energy for studying the 95 GeV light scalar is 210 GeV. A deep neural network classifier reduces the luminosity required for discovery by half. At $ L = 20\; \mathrm{ab}^{-1} $, the $ 5\sigma $ sensitivity of the CEPC to the signal strength $ \mu_{\tau\tau}^{ZS} $ reaches 0.016 and 0.020 for $ \sqrt{s} = 210 $ GeV and 240 GeV, respectively. The corresponding thresholds for a 5% precision measurement are $ \mu_{\tau\tau}^{ZS} \gt 0.10 $ and $ >0.12 $. At $ \sqrt{s}=210 $ GeV (240 GeV), the $ 5\sigma $ coverage of all N2HDM-Flipped samples with $ \chi^2_{h_{95}}<7.82 $ requires $ L=800\ \mathrm{fb}^{-1} $ (1.22 $ \mathrm{ab}^{-1} $). These results establish a 210 GeV run, augmented by machine-learning selection, as the most efficient strategy to confirm or refute the 95 GeV excess at future lepton colliders.





Abstract
HTML
Reference
Related
PDF












DownLoad: