Skip to main content

Immunoinformatics and analysis of antigen distribution of Ureaplasma diversum strains isolated from different Brazilian states



Ureaplasma diversum has numerous virulence factors that contribute to pathogenesis in cattle, including Lipid-associated membrane proteins (LAMPs). Therefore, the objectives of this study were to evaluate in silico important characteristics for immunobiological applications and for heterologous expression of 36 LAMPs of U. diversum (UdLAMPs) and, also, to verify by conventional PCR the distribution of these antigens in strains of Brazilian states (Bahia, Minas Gerais, São Paulo, and Mato Grosso do Sul). The Manatee database was used to obtain the gene and peptide sequences of the antigens. Similarity and identity studies were performed using BLASTp and direct antigenicity was evaluated by the VaxiJen v2.0 server. Epitope prediction for B lymphocytes was performed on the BepiPred v2.0 and CBTOPE v1.0 servers. NetBoLApan v1.0 was used to predict CD8+ T lymphocyte epitopes. Subcellular location and presence of transmembrane regions were verified by the software PSORTb v3.0.2 and TMHMM v2.2 respectively. SignalP v5.0, SecretomeP v2.0, and DOLOP servers were used to predict the extracellular excretion signal. Physico-chemical properties were evaluated by the web-software ProtParam, Solpro, and Protein-sol.


In silico analysis revealed that many UdLAMPs have desirable properties for immunobiological applications and heterologous expression. The proteins gudiv_61, gudiv_103, gudiv_517, and gudiv_681 were most promising. Strains from the 4 states were PCR positive for antigens predicted with immunogenic and/or with good characteristics for expression in a heterologous system.


These works contribute to a better understanding of the immunobiological properties of the UdLAMPs and provide a profile of the distribution of these antigens in different Brazilian states.


U. diversum, a member of the Mollicutes class, is a bovine pathogen related to reproductive disorders [1]. This agent presents the following outstanding characteristics: the production of ammonia, through urea hydrolysis, and the absence of a cell wall [2]. Although U. diversum infection is not conditioned by the presence of clinical symptoms, it can colonize the respiratory and genital/reproductive systems of cattle, generating severe inflammatory conditions often culminating in abortion [3]. It is considered an opportunistic pathogen found in the mucosa and secretions of the vulva, vagina, and udder of cows and secretion of the respiratory tract of calves [1].

Milk production in cows and spermatogenesis in bulls are also affected. U. diversum produces mastitis along with visible changes in the milk and udder [4]. In bulls, it causes seminal vesiculitis, balanoposthitis, epididymitis, and morphological and functional changes in sperm. Thus, U. diversum colonizes different regions of the reproductive system leading to active semen contamination [5]. Infection of semen for artificial insemination and in vitro fertilization results in serious obstacles to modern bovine reproduction techniques [6].

In addition to urease, U. diversum has sophisticated virulence mechanisms, including LAMPs, a mixture of mycoplasmic lipoproteins expressed on the cell surface that interact directly with host cells. These antigens are considered the main molecular agents associated with pathogens in several Mollicutes species and play an important role in host pathogenicity and immunomodulation [7]. In addition to lipoproteins, in the bovine ureaplasma genome, our research group identified genes encoding the multiple band antigen (MBA), which contain multiple series repetitions in the C-terminal region, as well as the gene for hemolysin and for the Mycoplasma Ig binding protein (MIB) and Mycoplasma Ig protease (MIP) –MIB-MIP system-, which acts by binding and cleaving the IgG heavy chain [2, 8].

The genomic sequencing of a species offers researchers new possibilities for research. Rapid analysis of all or part of the genome allows the construction of primers and screening of genes coding for virulence factors in the most diverse bacterial strains. The use of immunoinformatics tools allows screening with a high level of reliability of the physico-chemical and immunological properties of these molecules with low cost and reliable results [9]. The use of recombinant DNA technology can, through expression in a heterologous system, allow the analysis of virulence factors alone. Therefore, the objective of this work was to evaluate antigens of U. diversum regarding immunobiological properties and desirable characteristics for expression in a heterologous system, as well as to evaluate the distribution of these antigens in isolates from different regions of Brazil.


U. diversum antigens have low similarity with bovine proteome proteins

BLASTp analyses of the 36 UdLAMPs with bovine proteomes revealed that the maximum similarity occurred between the lipoprotein gudiv_159 and the Tinken-1 protein from Bos taurus taurus (29%). Bos taurus indicus had a similarity detected only for gudiv_517 (10%). The hybrid showed no significant similarity to any protein (Table 1).

Table 1 Analysis of similarity between sequences of 36 UdLAMPs and proteomes of bovine subspecies (Bos taurus taurus, Bos taurus indicus and the hybrid Bos taurus x Bos indicus) performed using the BLASTp tool

In silico analysis showed that U. diversum antigens have epitopes for B and T lymphocytes

Conformational and linear B cell epitopes were evaluated for the number of regions and the total percentage of amino acids in epitope regions. All proteins showed conformational epitopes for B lymphocytes. The most significant B cell epitopes are listed in Additional Table 1. The number of antigenic regions ranged from 2 in gudiv_388 to 124 in gudiv_398. The proteins with the lowest and highest percentage of amino acids in antigenic regions were gudiv_164 (4.3%) and gudiv_66 (39.9%) respectively (Table 2). Except for 10 proteins (gudiv_546, gudiv_457, gudiv_427, gudiv_442, gudiv_388, gudiv_357, gudiv_331, gudiv_228, gudiv_171 and gudiv_159), all the others have a number of predicted regions greater than or equal to the values for surface protein 5 (Msp5) from Anaplasma marginale (Table 3). In the prediction of linear epitopes the number of antigenic regions varied from 1 in gudiv_159 to 84 in gudiv_398. The protein with the highest percentage of amino acids in antigenic regions was gudivi_179 (90.4%). Thirty proteins had number of antigenic regions greater than or equal to Msp5. Eighteen of the 36 UdLAMPs were predicted to be antigenic (score greater than or equal to 0.5 on the VaxiJen server).

Table 2 Prediction of antigenicity and discontinuous and continuous B lymphocyte epitopes of each UdLAMP by the predictors CBTOPE v1.0, BepiPred v2.0, and VaxiJen v2. 0
Table 3 Prediction of binding of UdLAMPs (peptide windows with 9 amino acids) to different BoLA alleles (MHCI) performed through the NetBoLApan v1.0 server. The total of strong and weak connections is expressed in absolute numbers

In the prediction for major histocompatibility complex class I (MHCI) ligand, with the exception of gudiv_85 and gudiv_159, all other lipoproteins showed at least one predicted link for 4 of the 8 MHCI alleles bovine lymphocyte antigen (BoLA) studied (Table 3). Epitopes with strong binding in each BoLA allele are listed in Additional Tables 2 and 3. The maximum number of bonds was between the epitopes of the gudiv_398 protein and the BoLA-2 *01201 allele (75 bonds). Only three U. diversum antigens (gudiv_85, gudiv_331, and gudiv_388) had fewer connections than the Theileria parva 2 antigen (Tp2) in all alleles, of these, gudiv_85 did not show predicted connections in any allele (Table 3).

Some UdLAMPs have low identity compared to proteomes of other Mollicutes

The identity analysis of UdLAMPs with proteomes of other Mollicutes (Mycoplasma bovis, Mycoplasma canadense, Mycoplasma bovigenitalium, Mycoplasma bovirhinis and Mycoplasma díspar) revealed that only 8 proteins (gudiv_103, gudiv_159, gudiv_171, gudiv_228, gudiv_517, gudiv_546, gudiv_680, gudiv_681) did not present a significant identity with the analyzed proteomes. Twenty-four proteins showed an identity greater than 30% (Table 4).

Table 4 BLASTp identity analysis of 36 UdLAMPs with proteomes of M. bovis, M. canadense, M. bovigenitalium, M. bovirhinis and M. dispar

Some UdLAMPs have characteristics for heterologous expression in Escherichia coli

Parameters such as molecular weight (PM), instability index, aliphatic index, grand average of hydropathy (GRAVY), and solubility were predicted for U. diversum antigens. The protein PM varied between 9.0 and 240.2 (kilodalton) kDa. The proteins with the highest molecular weight were gudiv_398 (240.2 kDa), gudiv_162 (90.5 kDa) and gudiv_180 (88.7 kDa), while with lower molecular weight were gudiv_159, gudiv_85, and gudiv_331 with 13.3; 9.4 and 9.0 kDa (Table 5). The instability rates ranged from 9.16 (gudiv_499) to 67.15 (gudiv_331). In general, when this index is less than 40, proteins are considered stable; therefore, in this study, only 4 proteins (gudiv_93, gudiv_159, gudiv_331, and gudiv_560) were classified as unstable according to the prediction. To assess hydrophobicity, GRAVY was studied, GRAVY positive proteins were only gudiv_91, gudiv_228, gudi_357, and gudivi_546 with values​ of 0.05; 0.12; 0.61 and 0.05, respectively. As for solubility, the proteins gudiv_91, gudiv_171, gudiv_287, gudv_357, gudiv_458, and gudiv_560 were insoluble in both Protein-Sol and SOLpro. Gudiv_91 and gudiv_357 also presented 4 and 7 transmembrane loops, respectively (Table 5). In total, sixteen proteins were predicted to be soluble in the two predictors (Table 5).

Table 5 Prediction of physicochemical properties of UdLAMPs. Aliphatic index, PM, GRAVY, and instability index obtained in ProtParam. The solubility was predicted using the server SOLpro and Protein-Sol

A considerable number of UdLAMPs have a signal for excretion by the classical and non-classical pathways

The analysis of classical secretion mediated by signal peptide (SP) was performed by SignalP5. This server predicted SP in 29 of the 36 proteins studied. The size of the SPs ranged from 18 to 29 amino acids and all showed a cleavage site for peptidase II (sec / SPII). A cysteine immediately after the cleavage site can be seen in the predicted SPs (Table 6). The DOLOP server, which uses a series of criteria to predict bacterial lipoprotein SPs, including the preferred occurrence of amino acids, ranked 17 of the 29 proteins predicted by SignalP with typical SP lipoprotein carriers. Of the twenty-nine proteins predicted with the presence of SPs by SignalP, twenty-five also showed a prediction of non-classical excretion when submitted to the predictor SecretomeP (non-signal peptide-mediated excretion). In addition, some proteins (gudiv_61, gudiv_93, gudiv_162, gudiv_164, gudiv_179, gudiv_287, gudiv_331, gudiv_388, gudiv_546, gudiv_633, and gudiv_663) not discriminated as having SP for lipoproteins by DOLOP were predicted to be secreted by non-classic pathways (Table 6).

Table 6 Prediction of classical secretion of the signal peptide performed in SignalP v5.0 and DOLOP. Prediction of non-classical secretion by SecretomeP v2.0. Subcellular location and number of transmembrane loops predicted in PSORTb v3.0.2 and TMHMM v2.0

The prediction analysis reveals that UdLAMPs have important characteristics both for immunobiological applications and for expression in a heterologous system

The antigens of U. diversum have been classified according to undesirable properties for use in prophylactic and immunodiagnostic measures; and undesirable properties for expression in E. coli. The proteins gudiv_61, gudiv_103, gudiv_517, and gudiv_681 passed in all parameters, not being retained in any exclusion criteria established in this study Fig. 1. In addition, a considerable number of UdLAMPs were retained in only one or none of the exclusion criteria.

Fig. 1

Distribution of U. diversum antigens according to the prediction parameters. In red, the proteins included in the evaluated parameter and in white those not included. In undesirable parameters for use in prophylactic and immunodiagnostic measures, the relevant parameters for inducing the production of specific antibodies and positive immunomodulation are evaluated. In undesirable parameters for expression in E. coli, predicted parameters related to the production of stable, soluble, secreted proteins and with properties that facilitate the purification process after expression were evaluated

Gene coding sequences (CDS) for LAMPs predicted as antigenic are present in strains from different Brazilian states

To verify the distribution of U. diversum antigens in different Brazilian states, the presence of genes for LAMPs in 46 U. diversum strains was investigated by PCR. Table 7 lists the primers constructed. All antigens were detected in strain ATCC 49782. The lowest and highest percentage of amplified antigens (not considering the ATCC strain) occurred for strains S8 and 59, respectively, 5.6 and 83.3% (Fig. 2). Regarding antigens, the highest prevalence was gudiv_759, gudiv_357, and gudiv_91 detected in 87, 84.8, and 82.6% of the strains, respectively. In contrast, the least present were gudiv_402 (2.2%) and gudiv_458 (4.3%). The presence of antigens varied in the strains isolated from the states studied (Fig. 3). In Bahia, the state with the highest number of strains, a total of 35 antigens were detected by PCR. The only strain in Minas Gerais tested positive for seven proteins. Isolated representatives of Mato Grosso do Sul (805 and 9653) had 27 antigens. In São Paulo, all 13 strains were PCR positive for 34 proteins.

Table 7 Primers for amplifying UdLAMPs using conventional PCR
Fig. 2

Distribution of U. diversum antigens in 46 strains, one standard strain (ATCC) and 45 isolates from different Brazilian regions. The diagram shows detection by PCR of the antigens in each strain (in green PCR positive and in red negative). On the bottom is the total percentage of antigens that each strain carries in its genome (based on PCR results) and on the left is the percentage of strains carrying the coding sequence for each antigen individually

Fig. 3

Distribution of U. diversum antigens in different Brazilian states. The Brazilian states studied are represented in highlighted colors. In each state, the isolated strains and the set of UdLAMPs positive by PCR are used. In bold and underlined are UdLAMPs that (by prediction) showed good characteristics for immunological studies and expression in E. coli, not being retained in any of the exclusion criteria in Diagram 1. Only in bold, the antigens that, although they did not pass all criteria, were maintained in only one or two of the undesirable parameters for use in prophylactic and immunodiagnostic measures in Fig. 1. The figure was acquired at Wikimedia Commons ( and was adapted using Adobe Photoshop CS6 version 13.0.6 × 64


Mollicutes lipoproteins are important virulence factors associated with pathogenesis in the reproductive and respiratory tract of infected hosts [10]. In this study, the lipoprotein gudiv_159 had 29% similarity with the Tektin-1 protein from the Bos taurus taurus proteome. For the other UdLAMPs and proteins from other bovine subspecies, all similarity values were less than 12%. Similarity values greater than 25% are relevant when assessing immunological aspects [11]. The similarity between virulence factors and host proteins can make it difficult to develop an adequate immune response, or even generate cross-reaction events with autoantibody production during infection [12]. Mycoplasma hominis, M. fermentans and M. arthitides are species of Mollicutes often found in patients with autoimmune diseases [10].

A protective immune response with the production of effector cells and antibodies able to recognize epitopes of an infectious agent are essential for fighting infection. Conformational epitopes represent the majority of B cell epitopes (about 90%). However, conformational epitopes usually contain one or a few stretches of linear epitopes [13]. In the prediction, we found that all 36 UdLAMPs have conformational and linear epitopes for B lymphocytes and are predicted as antigenic (VaxiJen predictor). A considerable number of regions of conformational and linear epitopes were greater than or equal to the values for Msp5, one of the main surface proteins of A. marginale, known for its ability to induce antibody production during cattle infection [14]. The presence of these epitopes points to these molecules as agents capable of stimulating the development of a humoral immunological response.

U. diversum can also behave as an optional intracellular pathogen [15]. Thus, the possibility of UdLAMPs being processed and presented via MHCI can lead to cellular response activation. In this study, epitopes binding to bovine MHCI alleles were predicted in several UdLAMPs. Furthermore, 33 LAMPs had connections equal to or greater than the T. parva Tp2 antigen in all studied alleles. Tp2 is recognized for stimulating CD8+ T cells during bovine T. parva infection [16]. The studied alleles represent cattle destined for the different livestock sectors. Five alleles representing Bos taurus taurus (BoLA-6 * 01301, BoLA-2 * 01201, BoLA-3 * 00201, BoLA-1 * 02301 and BoLA-6 * 04101), two alleles representing Bos taurus indicus (Bola - T5, BoLA-3 * 00101) and an allele (BoLA-T2C) belongs to a hybrid [17]. Taurine breeds are predominantly found on dairy farms and Zebu cattle are mostly used for meat production [18]. Bovine hybrids are usually produced to align the commercial and management characteristics of both subspecies [19]. In this case, our prediction data reveal that a considerable number of UdLAMPs can interact with MHCI alleles of cattle destined for different activities in the livestock sector, reflecting in activation of inactivation immune response.

The identity analysis of UdLAMPs with proteomes of other microorganisms capable of infecting cattle is a useful initial approach for studies aimed at using these antigens or antibodies produced in immunodetection tests. We found that the proteins gudiv_103, gudiv_159, gudiv_171, gudiv_228, gudiv_517, gudiv_546, gudiv_680, and gudiv_681 did not present a significant identity with the proteins of other important Mollicutes that infect bovine. In contrast, 25 proteins showed an identity greater than 30%. According to Rost [20] above a cutoff point of 30% identity, 90% of the pairs are homologous. The low identity between proteins of different infectious agents from the same host is related to good specificity when considering detection tests [21]. Thus, U. diversum proteins with low identity may represent specific targets for use in immunodiagnostic techniques in detecting this pathogen.

In addition to the prediction of immunobiological properties, the prediction of properties favorable to expression in a heterologous system can contribute to the broad scale of a protein biological target. Some physicochemical properties influence the state of solubility, the formation of inclusion bodies or proteolysis of the heterologous peptide [22]. In this study, the protein PM ranged from 9.0 to 240.2 kDa. Proteins with PM between 70 and 60 kDa are well tolerated when E. coli is used as an expression system; however, proteins with very high PM are not adequately expressed in these bacteria, and are, therefore, degraded or structured in the form of inclusion bodies [23]. Small peptides (about 10 kDa) are also difficult to express in stable form due to improper folding, so they are often subject to proteolytic degradation [24].

Our analyses also showed that only gudiv_93, gudiv_159, gudiv_331 and gudiv_560 had an instability index greater than 40 and, therefore, all the others (with an index below 40) were considered to be stable [25]. Most of the proteins were GRAVY negative, which is related to hydrophilicity [26]. Greater hydrophilicity implies a greater capacity to form hydrogen bonds with water molecules and, consequently, greater solubility [27]. Sixteen proteins were predicted to be soluble in the two predictors used in this work (Solpro and proteinSol) and only two proteins had more than two predicted transmembrane loops. Transmembrane loops are hydrophobic regions that reduce solubility [28]. Expression in the soluble form is desirable, because to obtain soluble proteins from insoluble forms, a series of processing steps that involves the use of strong denaturants followed by renaturation is inevitable [29]. Even so, these additional steps do not guarantee the production of soluble and functional proteins.

The presence of specific markers capable of directing heterologous peptides to the extracellular medium in an expression system also contributes to the subsequent steps in the production of recombinant proteins [23, 30, 31]. Here, we show that more than half of the studied proteins were predicted to possess a SP recognized by sec/SPII and consequently likely a lipoprotein capable of being expressed and exported to the extracellular medium by E. coli. The presence of a SP for a classical secretory pathway or markers for secretion by a non-classical pathway facilitates the transport and secretion of the transcript into the extracellular compartment. Secretion in the extracellular medium simplifies purification processes, protects heterologous proteins from proteolysis, decreases endotoxin levels, and improves biological activity and solubility [32].

Bacterial proteins with good properties both for stimulating the immune response and for cloning and expression in a heterologous system are desirable targets for biotechnology [30]. In this study, the use of a filter with exclusion criteria based on the prediction data (In Diagram 1) showed that gudiv_61, gudiv_103, gudiv_517, and gudiv_681 are the ULAMPs most promising for immunobiological applications and for expression in E. coli as a heterologous system. However, the fact that an antigen does not meet all the requirements of Diagram 1 does not rule it out as a target for immunobiological studies or expression in a heterologous system. Depending on the type of analysis, proteins having good immunostimulatory properties, but with properties that hinder expression in E. coli could be expressed in other expression systems [33], or even in E. coli through fusion with proteins (tag) that increase the size of the transcript or improve solubility, reduce growth temperature, use of weak promoters and use of low concentrations of inducer [24]. Very large proteins or with many transmembrane loops could be studied by producing multiepitope chimeric proteins [34]. Finally, there is also the possibility of using expression systems entirely in vitro [35]. However, these alternatives increase the costs of the process; therefore, the inclusion of prediction in the planning stages of works that intend to express proteins can reduce project costs in addition to providing a theoretical forecast of bench tests.

In this work, the PCR detection of 36 UdLAMPs in isolates from U. diversum, from different regions of Brazil, warns of potential damage to livestock that U. diversum can cause, because in addition to immunomodulation, studies suggest that LAMPs are involved in adherence and invasion and cell apoptosis [2, 7, 15, 36]. Strains representing the four evaluated states (Bahia, Minas Gerais, São Paulo, and Mato Grosso do Sul) presented proteins with interesting properties for immunological stimulation (Diagrams 1 and 3). These data corroborate with other studies that show that U. diversum induces variable immune responses in vivo and in vitro [7, 37].


It was demonstrated that the U. diversum genome has CDS for molecules with potential for application in immunodiagnostic or immunoprophylactic tests and expression in E. coli as a heterologous system. PCR screening of antigens on strains from different states revealed that UdLAMPs have a heterogeneous distribution in different regions of Bahia, Minas Gerais, São Paulo, and Mato Grosso do Sul. In this study, 34 of the 36 UdLAMPs studied were noted in the genome as UdLAMPs and that many of them have signaling of typical lipoprotein secretion. It is well described in the literature that Mollicutes have ingenious molecular mechanisms to change parts of these molecules; however, this initial study contributes to understanding the virulence factors of U. diversum and provides a series of data and approaches that can be used in studying these pathogens.


Access to genes and analysis of similarity with bovine proteomes

The CDS and peptide sequences of 36 UdLAMPs, strain ATCC 49782, were accessed through the Manatee database ( The DNA sequences also are available in the GenBank: CP009770). Similarity analyses between proteins of U. diversum and proteomes of bovine subspecies (Bos taurus taurus, Bos taurus indicus and the hybrid Bos taurus x Bos indicus) were performed using the BLASTp tool accessed on the server The proteomes were accessed on the BLASTp platform itself through the UniProtKB/Swiss-prot (swissprot) database under taxonomy IDs 9913 (Bos taurus taurus, protein count: 37513), 9915 (Bos taurus indicus, protein count 1243) and 30,522 (hybrid Bos taurus x Bos indicus, protein count: 42151).

Mapping of B lymphocyte epitopes and antigenicity prediction

The CBTOPE v1.0 server (available at was used to predict discontinuous (conformational) epitopes of B lymphocytes. A threshold of − 0.3 was used, and on the probability scale (0–9) amino acids with values greater than four were considered conformational epitopes. This server has a data set with non-redundant protein chains consisting of antibody interacting residues of B cell epitopes [38]. To predict continuous epitopes, the primary protein sequences were analyzed in the BepiPred v2.0 software (, a predictor trained only with data, present in your internal database, from epitopes derived from crystallographic structures. Amino acids with thresholds greater than 0.5 were considered linear B cell epitopes [13]. The protein sequences were also submitted to the VaxiJen v2.0 server (; this predictor allows classifying antigens without using the sequence alignment feature. All proteins predicted to score above thresholds (0.5) were classified as antigenic. The prediction of B cell epitopes and antigenicity was also performed for the Msp5 ESXA_MYCBO peptide from A. marginale accessed at NCBI under ID number AY527217.1.

Mapping of TCD8+ lymphocyte epitopes and identity analysis with proteomes of other Mollicutes

The prediction of binding to MHCI with peptide windows with 9 amino acids, was performed using the server NetBoLApan v1.0, accessed at A standard threshold of 0.5% was used for strong bonds and 2% for weak bonds; finally, the number of strong and weak connections were added and expressed in absolute numbers. The NetBoLApan v1.0 was trained on a peptide dataset with binding affinity to BoLA molecules [39]. The alleles used in this study were BoLA-6*01301 (HD6), BoLA-2*01201 (T2A), BoLA-3*00201 (JSP), BoLA-1*02301 (D18.4), BoLA-3*00101 (AW10), BoLA-6*04101 (T2B), BoLA-T2C and Bola–T5. In this set of alleles there are representatives of three bovine subspecies (Bos taurus taurus, Bos taurus indicus and the hybrid Bos taurus taurus x Bos taurus indicus), thus including cattle involved in various livestock activities [17]. The same analyses were performed for Theileria parva Tp2 antigen.

The BLASTp was used for identity analysis of 36 UdLAMPs with proteomes of M. bovis (831 protein count and taxonomy IDs: 28903), M. canadense (481 protein count and taxonomy IDs: 29554), M. bovigenitalium (677protein count and IDs taxonomy: 1188235), M. bovirhinis (720 protein count and taxonomy IDs: 29553), and M. dispar (712 protein count and taxonomy IDs: 86660).

Prediction of secretion and subcellular localization

Prediction of classical secretion and identification of SP were performed on the SignalP v5.0 server available at and DOLOP, a server that uses SP characteristics to predict lipoproteins - The SecretomeP v2.0 web server ( was used to predict non-classical secretion. Predicted values equal to or greater than 0.5 (threshold) were considered indicative of secretion. Protein sequences were also subjected to the prediction of subcellular location in the PSORTb v3.0.2 software ( using suggested settings for Mycoplasma spp. TMHMM v2.0 ( was used to verify the presence of transmembrane loops.

Investigation of physical-chemical parameters

The physicochemical properties of UdLAMPs including aliphatic index, PM, GRAVY, and instability index were obtained in ProtParam using the ExPASy server at The solubility of heterologous peptides after E. coli overexpression was predicted by the server SOLpro ( and Protein-Sol, accessed at

Filter properties related to immunomodulation and expression in a heterologous system

U. diversum antigens were classified according to two parameters: 1) Undesirable parameters for use in prophylactic and immunodiagnostic measures; in which prediction results for similarity with bovine proteomes were evaluated, number of conformational and continuous epitope regions for B lymphocytes, antigenicity, number of T lymphocyte epitopes (BoLA allele ligands) and identity with other Mollicutes that infect cattle; 2) undesirable parameters for expression in E. coli; in which predicted parameters related to the absence of signaling for excretion by classical or non-classical pathways, protein size, stability index, GRAVY, solubility and presence of transmembrane loops were evaluated.

Obtaining, cultivating and extracting DNA from U. diversum

U. diversum ATCC 49782 and 45 isolates were provided by the Mycoplasma laboratory of the Institute of Biomedical Sciences - University of São Paulo (USP). Some strains were isolated from cows that had granulomatous vulvovaginitis, and others were isolated from the semen of healthy bulls. The isolates were obtained from four states: 19 isolated in São Paulo (farms 1, 2, 4, 8 and 9), 2 isolated in Mato Grosso do Sul (farm 3), 1 in Minas Gerais (farm 6), and 22 in Bahia (farms 10, 11, 12, 13). One milliliter of each sample previously-stored in UB medium was grown in 9 ml of the same medium at 37 °C for 24 to 48 h [7]. After growth, bacterial DNA was extracted using the NucleoSpin kit (Macherey-Nagel, Germany) following the manufacturer’s instructions. After growth, bacterial DNA was extracted using the NucleoSpin kit (Macherey-Nagel, Germany) following the manufacturer’s instructions.

Primer construction, PCR, and electrophoresis

The genomic sequences coding for the 36 antigens of U. diversum were used to design the primers by the servers and Important criteria for the efficiency of primers such as size (18 to 22 bp), melting temperature (52 to 58 °C), and G + C content (40 to 60%) were taken into account. After selecting the best pairs of primers, similarity analysis was performed by the BLASTn server ( to confirm the specificity of the primer sequence for U. diversum. The 46 strains were evaluated for the presence of genes for UdLAMPs using PCR. The amplifications were performed with a total volume of 25 μl containing: 1 μl of DNA, 10x PCR buffer (10 mM Tris – HCl, pH 9.0; 50 mM KCl), 1.5 mM MgCl2; 200 μM dNTP, 50 pmol of each primer and 1.5 U of Taq DNA polymerase (Invitrogen®, Brazil). All genes followed the initial denaturation of 94 °C for 5 min, followed by 35 thermal cycles of 94 °C for 30 s, 54 °C for 30 s, and 72 °C for one minute, concluding with a final extension 72 °C for 5 min. The reaction products were analyzed by electrophoresis on 1.5% agarose gel, stained with 2.5 μl ethidium bromide (10 mg / ml), visualized and photographed under UV light. A molecular weight marker (Invitrogen®, Brazil) - was used as a standard to assess the size of the amplified fragments.

Availability of data and materials

The DNA sequences generated and/or analyzed during the current study are available in the GenBank repository, Accession: CP009770. The others datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.



Amino acids


American Type Culture Collection


Bovine lymphocyte antigen


Gene coding sequences



G +C:

Guanine-cytosine content


Grand average of hydropathy


Immunoglobulin G




Lipid-associated membrane protein


Multiple band antigen


Major histocompatibility complex class I


Mycoplasma Ig binding protein


Mycoplasma Ig protease


Surface protein 5 from A. marginale


Polymerase Chain Reaction


Molecular weight


Signal peptide


T. parva 2 antigen


Ureaplasma base


Lipid-associated membrane protein of U. diversum


  1. 1.

    Marques LM, Buzinhani M, Oliveira RC, Yamaguti M, Ferreira JB, Neto RL, et al. Prevalence of mycoplasmas in the respiratory tracts of calves in Brazil. Vet Rec. 2007;161(20):699–700.

    CAS  Article  Google Scholar 

  2. 2.

    Marques LM, Rezende IS, Barbosa MS, Guimarães AM, Martins HB, Campos GB, et al. Ureaplasma diversum genome provides new insights about the interaction of the surface molecules of this bacterium with the host. PLoS One. 2016;11(9):e0161926.

    Article  Google Scholar 

  3. 3.

    Cardoso MV, Scarcelli E, Grasso LM, Teixeira SR, Genovez ME. Ureaplasma diversum and reproductive disorder in Brazilian cows and heifers; first report. Anim Reprod Sci. 2000;63(3–4):137–43.

    CAS  Article  Google Scholar 

  4. 4.

    Gourlay RN, Howard CJ, Brownlie J. The production of mastitis in cows by the intramammary inoculation of T-mycoplasmas. J Hyg (Lond). 1972;70(3):511–21.

    CAS  Article  Google Scholar 

  5. 5.

    Marques LM, Buzinhani M, Guimaraes AM, Marques RC, Farias ST, Neto RL, et al. Intraspecific sequence variation in 16S rRNA gene of Ureaplasma diversum isolates. Vet Microbiol. 2011;152(1–2):205–11.

    CAS  Article  Google Scholar 

  6. 6.

    Crane MB, Hughes CA. Can Ureaplasma diversum be transmitted from donor to recipient through the embryo? Two case reports outlining U. diversum losses in bovine embryo pregnancies. Can Vet J. 2018;59(1):43.

    PubMed  PubMed Central  Google Scholar 

  7. 7.

    Santos-Junior MN, Rezende IS, Souza CLS, Barbosa MS, Campos GB, Brito LF, et al. and Its Ureaplasma diversum and Its Membrane-Associated Lipoproteins Activate Inflammatory Genes Through the NF-κB Pathway via Toll-Like Receptor 4. Front Microbiol. 2018;9:1538.

    Article  Google Scholar 

  8. 8.

    Marques LM, Guimarães AM, Martins HB, Rezende IS, Barbosa MS, Campos GB, et al. Genome sequence of Ureaplasma diversum strain ATCC 49782. Genome Announc. 2015;3(2):e00314–5.

    Article  Google Scholar 

  9. 9.

    Zhang C, Marcia M, Langer JD, Peng G, Michel H. Role of the N-terminal signal peptide in the membrane insertion of a quifex aeolicus F1F0 ATP synthase c-subunit. FEBS J. 2013;280(14):3425–35.

    CAS  Article  Google Scholar 

  10. 10.

    da Rocha Sobrinho HM, Jarach R, da Silva NA, Shio MT, Jancar S, Timenetsky J, et al. Mycoplasmal lipid-associated membrane proteins and mycoplasma arthritidis mitogen recognition by serum antibodies from patients with rheumatoid arthritis. Rheumatol Int. 2011;31(7):951–7.

    Article  Google Scholar 

  11. 11.

    Hara Y, Mohamed R, Nathan S. Immunogenic Burkholderia pseudomallei outer membrane proteins as potential candidate vaccine targets. PLoS One. 2009;4(8):e6496.

    Article  Google Scholar 

  12. 12.

    Gutlapalli VR, Sykam A, Nayarisseri A, Suneetha S, Suneetha LM. Insights from the predicted epitope similarity between mycobacterium tuberculosis virulent factors and its human homologs. Bioinformation. 2015;11(12):517–24.

    Article  Google Scholar 

  13. 13.

    Jespersen MC, Peters B, Nielsen M, Marcatili P. BepiPred-2.0: improving sequence-based B-cell epitope prediction using conformational epitopes. Nucleic Acids Res. 2017;45(W1):W24–W9.

    CAS  Article  Google Scholar 

  14. 14.

    Corona B, Machado H, Rodríguez M, Martínez S. Characterization of recombinant MSP5 Anaplasma marginale Havana isolate. Braz J Microbiol. 2009;40(4):972–9.

    CAS  Article  Google Scholar 

  15. 15.

    Buzinhani M, Yamaguti M, Oliveira RC, Cortez BA, Marques LM, Machado-Santelli GM, et al. Invasion of Ureaplasma diversum in bovine spermatozoids. BMC Res Notes. 2011;4:455.

    Article  Google Scholar 

  16. 16.

    Pelle R, Graham SP, Njahira MN, Osaso J, Saya RM, Odongo DO, Toye PG, Spooner PR, Musoke AJ, Mwangi DM, Taracha EL, Morrison WI, Weir W, Silva JC, Bishop RP. Two Theileria parva CD8 T cell antigen genes are more variable in buffalo than cattle parasites, but differ in pattern of sequence diversity. PLoS One. 2011;6(4):e19015.

  17. 17.

    Hansen AM, Rasmussen M, Svitek N, Harndahl M, Golde WT, Barlow J, et al. Characterization of binding specificities of bovine leucocyte class I molecules: impacts for rational epitope discovery. Immunogenetics. 2014;66(12):705–18.

    CAS  Article  Google Scholar 

  18. 18.

    Teodoro RL, Martinez ML, Silva M, Machado MA, Verneque R. Resistência bovina ao carrapato Boophilus microplus: experiência brasileira. In: Anais do V Simpósio da Sociedade Brasileira de Melhoramento Animal, Pirassununga, SBMA; 2004.

    Google Scholar 

  19. 19.

    Yisehak K, Becker A, Belay D, Bosch G, Hendriks WH, Clauss M, et al. Salivary amino acid concentrations in zebus (Bos indicus) and zebu hybrids (Bos indicus× Bos taurus) fed a tannin-rich diet. Belgian J Zool. 2011;141(2):93–6.

    Google Scholar 

  20. 20.

    Rost B. Twilight zone of protein sequence alignments. Protein Eng. 1999;12(2):85–94.

    CAS  Article  Google Scholar 

  21. 21.

    Khan FA, Faisal M, Chao J, Liu K, Chen X, Zhao G, et al. Immunoproteomic identification of MbovP579, a promising diagnostic biomarker for serological detection of mycoplasma bovis infection. Oncotarget. 2016;7(26):39376.

    Article  Google Scholar 

  22. 22.

    Chang CCH, Song J, Tey BT, Ramanan RN. Bioinformatics approaches for improved recombinant protein production in Escherichia coli: protein solubility prediction. Brief Bioinform. 2014;15(6):953–62.

    CAS  Article  Google Scholar 

  23. 23.

    Adrio J-L, Demain AL. Recombinant organisms for production of industrial products. Bioengineered Bugs. 2010;1(2):116–31.

    Article  Google Scholar 

  24. 24.

    Sørensen HP, Mortensen KK. Advanced genetic strategies for recombinant protein expression in Escherichia coli. J Biotechnol. 2005;115(2):113–28.

    Article  Google Scholar 

  25. 25.

    Gasteiger E, Hoogland C, Gattiker A, Duvaud S, Wilkins MR, Appel RD, et al., Protein identifica-tion and analysis tools on the ExPASy server, in The Proteomics Protocols Handbook. Walker JM Ed., Totowa: Humana Press; 2005.

  26. 26.

    Vandahl BB, Christiansen G, Birkelund S. Preparation of bacterial samples for 2D-PAGE; in The proteomics protocol handbook. JM Walker (ed). Totowa: Humana press Inc.; 2005. p. 19–27.

  27. 27.

    Negahdaripour M, Nezafat N, Hajighahramani N, Soheil Rahmatabadi S, Hossein Morowvat M, Ghasemi Y. In silico study of different signal peptides for secretory production of interleukin-11 in Escherichia coli. Current Proteomics. 2017;14(2):112–21.

    CAS  Article  Google Scholar 

  28. 28.

    Käll L, Krogh A, Sonnhammer EL. A combined transmembrane topology and signal peptide prediction method. J Mol Biol. 2004;338(5):1027–36.

    Article  Google Scholar 

  29. 29.

    Singh SM, Panda AK. Solubilization and refolding of bacterial inclusion body proteins. J Biosci Bioeng. 2005;99(4):303–10.

    CAS  Article  Google Scholar 

  30. 30.

    Chaudhry R, Nisar N, Hora B, Chirasani SR, Malhotra P. Expression and immunological characterization of the carboxy-terminal region of the P1 adhesin protein of mycoplasma pneumoniae. J Clin Microbiol. 2005;43(1):321–5.

    Article  Google Scholar 

  31. 31.

    Rosano GL, Ceccarelli EA. Recombinant protein expression in Escherichia coli: advances and challenges. Front Microbiol. 2014;5:172.

    PubMed  PubMed Central  Google Scholar 

  32. 32.

    Mohammadi S, Mostafavi-Pour Z, Ghasemi Y, Barazesh M, Pour SK, Atapour A, et al. In silico analysis of different signal peptides for the excretory production of recombinant NS3-GP96 fusion protein in Escherichia coli. Int J Pept Res Ther. 2019;25(4):1279–90.

    CAS  Article  Google Scholar 

  33. 33.

    Kaur J, Kumar A, Kaur J. Strategies for optimization of heterologous protein expression in E. coli: Roadblocks and reinforcements. Int J Biol Macromol. 2018;106:803–22.

    CAS  Article  Google Scholar 

  34. 34.

    AnandaRao R, Swaminathan S, Fernando S, Jana AM, Khanna N. A custom-designed recombinant multiepitope protein as a dengue diagnostic reagent. Protein Expr Purif. 2005;41(1):136–47.

    CAS  Article  Google Scholar 

  35. 35.

    Popot JL. Amphipol-Assisted Cell-Free Expression of Membrane Proteins. In: Membrane Proteins in Aqueous Solutions. Biological and Medical Physics, Biomedical Engineering. Cham: Springer; 2018.

  36. 36.

    Hobson N, Chousalkar KK, Chenoweth PJ. Ureaplasma diversum in bull semen in Australia: its detection and potential effects. Aust Vet J. 2013;91(11):469–73.

    CAS  Article  Google Scholar 

  37. 37.

    Gaeti JG, Lana MV, Silva GS, Lerner L, de Campos CG, Haruni F, et al. Ureaplasma diversum as a cause of pustular vulvovaginitis in bovine females in Vale Guapore, Mato Grosso state, Brazil. Trop Anim Health Prod. 2014;46(6):1059–63.

    Article  Google Scholar 

  38. 38.

    Ansari HR, Raghava GP. Identification of conformational B-cell epitopes in an antigen from its primary sequence. Immunome Res. 2010;6(1):6.

    Article  Google Scholar 

  39. 39.

    Nielsen M, Connelley T, Ternette N. Improved prediction of bovine leucocyte antigens (BoLA) presented ligands by use of mass-spectrometry-determined ligand and in vitro binding data. J Proteome Res. 2018;17(1):559–67.

    CAS  Article  Google Scholar 

Download references


We also thank Aricelma P. França for invaluable technical assistance and Jim Hesson of Academic English Solutions ( for revising the English.


This work was supported by grants from the São Paulo Research Foundation (FAPESP) (grant number 2017/25686–3), the National Council for Scientific and Technological Development (CNPq) (grant number 409758/2018–3) and Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) (Finance Code 001). The funders played no role in the design as well as conclusion of the study.

Author information




Conceived and designed the experiments: MNSJ, GBC, BLB, JT and LMM. Immunoinformatics analysis: MNSJ, RSS, WSN, BCBG, MSB, LSCS, CPG, ISR. Molecular analysis: MNSJ, JMF, CNTO, ISR, NSMN. Analyzed the data: MNSJ, GBC, BLB, GBC, JT and LMM. Contributed reagents/materials/analysis tools: GBC, BLB, JT and LMM. Wrote the paper: MNSJ, GBC, JT and LMM. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Lucas Miranda Marques.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1: Table S1.

Sequences of the most significant B lymphocyte epitopes (highest thresholds in the Pepipred 2.0 predictor) accompanied by the position of the respective epitope in each UdLAMP.

Additional file 2: Table S2.

Sequence of the most representative CD8+ T lymphocyte epitopes and respective position in each UdLAMP predicted using NetBoLApan v1.0. Peptides with an 8-amino acid window of the BoLA-1 *02301, BoLA-3 *00201, BoLA-2 *01201 and BoLA-6 *01301 alleles are represented.

Additional file 3: Table S3.

Sequence of the most representative CD8+ T lymphocyte epitopes and respective position in each UdLAMP predicted using NetBoLApan v1.0. Peptides with an 8-amino acid window of the BoLA-3 *00101, BoLA-4 *04101, BoLA-T2C, and BoLA-T5 alleles are represented.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Santos Junior, M.N., Santos, R.S., Neves, W.S. et al. Immunoinformatics and analysis of antigen distribution of Ureaplasma diversum strains isolated from different Brazilian states. BMC Vet Res 16, 379 (2020).

Download citation


  • Ureaplasma diversum
  • Immunoinformatics
  • Lipoproteins
  • Prediction