SysteMHC v.231201
Home Datasets Download Help About Update

About

The SysteMHC Atlas project is part of the Human Immuno-Peptidome Project (HIPP). HIPP was launched in 2015 as a new initiative of the Human Proteome Organization (HUPO). The long-term goal of the HIPP is to map the entire repertoire of peptides presented by MHC molecules using mass spectrometry technologies and make its robust analysis accessible to any immunologist. Toward this end, HIPP plans to embrace partnerships, prioritize technology development and maximize data sharing. The SysteMHC Atlas project supports this latter goal and this is an updated one (Second Version) .

Overview of the build process of SysteMHC Atlas.

Overview of the updated SysteMHC build process:

Following are the key parameters:

General Parameters
ParameterValue
Precursor tolerance10 ppm
High accuracy fragment ion tolerance (Orbi/TOF/QE)10 ppm
Low accuracy fragment ion tolerance0.5Da
Digestion specificityunspecific
Open Search Parameters
Delta massNameDescription
-105.0248Met-loss+acetaldehydeMet-loss+acetaldehyde
-89.0299Met-loss+AcetylMet-loss+Acetylation
-33.9877Cys->DhaDehydroalanine
-32.0085Met->AspSAMethionine-oxidation-to-aspartic-semialdehyde
-30.0106Pro->PyrrolidinoneProline-oxidation-to-pyrrolidinone
-29.9928Met->HseHomoserine
-18.0106Glu->pyro-GluGlu->pyro-Glu
-17.0265Gln->pyro-GluGln->pyro-Glu
-2.0157Didehydrosecond-amino-3-oxo-butanoic_acid
-1.9979Met->GluMet->Glu substitution
-1.9615Cys->ThrCys->Thr substitution
-0.984AmidationAmidation
+0.984DeamidatedDeamidation
+3.9949Trp->Kynurenintryptophan-oxidation-to-kynurenin
+12.0Thiazolidineformaldehyde-adduct
+13.9793Trp->OxolactoneTryptophan-oxidation-to-oxolactone
+14.0157MethylMethylation
+15.9949OxidationOxidation or Hydroxylation
+19.9898Trp->Hydroxykynurenintryptophan-oxidation-to-hydroxykynurenin
+21.9694Cation-MgReplacement-of-2-protons-by-magnesium
+21.9819Cation-NaSodium-adduct
+23.9581Cation-AlReplacement-of-3-protons-by-aluminium
+26.0157Delta-H(2)C(2)Acetaldehyde+26
+27.9949FormylFormylation
+28.0313Dimethyldi-Methylation
+28.9902Nitrosylnitrosylation
+29.9742quinoneCquinone
+30.0106hydroxymethylhydroxymethyl
+31.9721persulfidepersulfide
+31.9898dihydroxydihydroxy
+37.9469Cation-CaReplacement-of-2-protons-by-calcium
+37.9558Cation-KReplacement-of-proton-by-potassium
+42.0106AcetylAcetylation
+42.047Trimethyltri-Methylation
+43.0058CarbamylCarbamylation
+43.9898CarboxyCarboxylation
+44.9851NitroOxidation-to-nitro
+47.9847Trioxidationcysteine-oxidation-to-cysteic-acid
+53.9193Cation-FeReplacement of 2 protons by iron
+58.0055CarboxymethylIodoacetic acid derivative
+68.0262CrotonylCrotonylation
+70.0418CrotonaldehydeCrotonaldehyde
+79.9568SulfoO-Sulfonation
+79.9663PhosphoPhosphorylation
+80.0374Gly->HisGly->His
+86.0004MalonylationMalonylation
+100.016SuccinylSuccinic-anhydride-labeling-reagent-light-form(N-term&K)
+119.0041CysteinylCysteinylation
+119.0371pyridylacetylpyridylacetyl
+146.0579FucosylationFucose
+162.0528HexHexose
+176.0321Glucuronylhexuronic acid
+178.0477GluconoylationGluconoylation
+183.0354AEBSAminoethylbenzenesulfonylation
+203.0794HexNAcN-Acetylhexosamine
+204.1878FarnesylationFarnesylation
+210.1984MyristoylationMyristoylation
+229.014Pyridoxal PhosphatePyridoxal phosphate
+238.2297PalmitoylationPalmitoylation
+340.1006Glucosylgalactosylglucosylgalactosyl hydroxylysine
+349.1373HexNAc1dHex1HexNAc1dHex1
+365.1322Hex1HexNAc1Hex1HexNAc1
+406.1587HexNAc2HexNAc2
+541.0611ADP-RibosylADP Ribose addition

In general, the computational pipeline was based on Trans-proteomic Pipeline version 6.0.0 -- We used msConvert to convert mass spectrometric raw files into mzML/mzXML files with default settings. And three search engines (i.e. Comet, MSFragger and MSGF+) were used to perform (closed/offset) database searches with a target-decoy strategy. The instrument-specific search parameters were described above. For statistical validation, PeptideProphet was first applied with the accurate mass model enabled. Then, iProphet was used to combine the outputs of PeptideProphet from all three search engines.

To predict the binding affinity, NetMHCpan-4.1 and MixMHCpred-2.2 were used for MHC Class I peptides with the default settings, and NetMHCIIpan-4.1 and MixMHC2pred-2.0 were used for MHC class II peptides. The Motifs of predicted binders of Class I and II were deconvoluted by GibbsCluster-2.0 and MoDec-1.2, respectively. Additionally, we used MixMHCp-2.1 and MoDec-1.2 to perform direct motif deconvolution for the MHC class I immunopeptidome and MHC class II immunopeptidome, respectively.

The Annotation score for the peptides annotated for the allele is calculated using the strategy described in Caron et al. eLife 2015, based on the binding affinity prediction by NetMHCpan and NetMHCIIpan. Specificlly, it is calculated by dividing the second lowest IC50 value (second best predicted allele) by the lowest IC50 value (best predicted allele).

At a 1% peptide level FDR estimated by iProphet, spectral libraries were constructed by SpectraST with default settings for consensus library building. For each sample, a sample-specific consensus spectral library was generated. For each allele, an allele-specific spectral library was generated on the atlas level that contains consensus spectra of peptides from different samples ranging from diverse tissue types and disease types. In the allele-specific spectral library, the same peptide ions generated under various fragmentation methods were specified and kept separated as different library entries. For each PTM, a PTM-specific spectral library was generated.

The current build version is 230601.

How to cite SysteMHC Atlas

Please acknowledge the SysteMHC Atlas in your publications by citing the following manuscript:

Shao, W.; Pedrioli, P.G.A. et al. The SysteMHC Atlas project. Nucleic Acids Res doi:10.1093/nar/gkx664

Huang, X.; Gan, Z. The SysteMHC Atlas v2.0, an updated resource for mass spectrometry-based immunopeptidomics. Nucleic Acids Res doi:10.1093/nar/gkad1068

The computational pipeline

The computational pipeline SysteMHC-pipeline is provided by GitHub repository.

The team behind SysteMHC Atlas:

Licenses

SysteMHC Atlas is made available under the Open Database License: http://opendatacommons.org/licenses/odbl/1.0/.

Any rights in individual contents of the database are licensed under the Database Contents License: http://opendatacommons.org/licenses/odbl/1.0/.

Contact us: