Published online before print May 22, 2003
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||



* Laboratory for Cellular Differentiation, Department for Stemcell Biology, and
Department of Theoretical Physics, Lund University, Sweden
Correspondence: Mikael Sigvardsson, Laboratory for Cellular Differentiation, Department for Stemcell Biology, BMC B12, Lund, 22184, Sweden. E-mail: Mikael.Sigvardsson{at}stemcell.lu.se
|
|
|---|
Key Words: gene expression immunoglobulin progenitor B cells
|
|
|---|
-subunit of the interleukin (IL)-7 receptor [5
6 ]. Subsequent differentiation results in the expression of the recombination-activating genes Rag-1 and Rag-2 and initiation of Ig recombination events [7
]. This generates a functional Ig heavy-chain (IgH) gene that is transcribed, translated, and displayed on the cell surface in complex with the surrogate light-chain components
5 and VpreB, as well as the signal-transduction molecules Ig
(mb-1) and Igß (B29) [8
]. Subsequent differentiation allows for rearrangements of the Ig light-chain (IgL) genes that replace the surrogate light-chain genes on the surface of the B cell [8
]. This immature cell is then subjected to negative selection to delete self-reactive cells before it leaves the BM to enter peripheral lymphoid organs, where it becomes a mature B cell [9
]. If the cells are activated by interaction with antigens and obtain T cell help, they mature into terminally differentiated plasma cells secreting large amounts of antibodies [10
11
12
]. The extensively studied biology of B cell development, in combination with the defined stages of differentiation, makes it a useful system for investigations of complex molecular events that might provide clues to general features of cellular development.
To obtain a deeper understanding of the molecular processes involved in B cell development and to create a map over stage-restricted gene expression, we wanted to establish a model system that allowed for a reasonable approximation of the gene expression profile during B lymphoid differentiation. Features such as varying proliferation status, heterogeneous populations, and difficulties to obtain sufficient amounts of material limit the use of primary sorted cells. The existence of numerous B cell lines arrested at defined differentiation stages would then pose a possibility to overcome some of these problems. They also provide a highly reproducible source of material that allows for the performance of large-scale gene expression analysis without the use of intermediate amplification steps. The use of cell lines does, however, introduce a risk of obtaining cell line-specific features as a result of the transformation process. To reduce the risk of analyzing cell line-specific features, we used several representative cell lines for each of four major stages in B cell development: pro-B, pre-B, B, and plasma cells, and investigated the gene expression pattern in these cell lines by AffymetrixTM microarrays containing
12,000 gene tags. This allowed for the correct classification of a large number of control genes using dCHIP-based, relative expression level analysis [13
] or presence/absence (P/A)-based, probabilistic state analysis (S. Bilke et al., submitted). We also identified a large number of additional genes that now can be considered as candidates to display stage-restricted expression patterns during B cell development.
|
|
|---|
Gene expression analysis
RNA was prepared using Trizol (Gibco, Grand Island, NY), and 7.5 µg of total RNA was annealed to a T7-oligo T primer by denaturation at 70°C for 10 min followed by 10 min of incubation of the samples on ice. First-strand synthesis was performed for 2 h at 42°C using 20 U Superscript reverse transcriptase (RT; Gibco) in buffers and nucleotide mixes according to the manufacturers instructions. This was followed by a second-strand synthesis for 2 h at 16°C, using RNaseH, Escherichia coli DNA polymerase I, and E. coli DNA ligase (all from Gibco), according to the manufacturers instructions. The obtained, double-stranded cDNA was then blunted by the addition of 20 U T4 DNA polymerase and incubated for 5 min at 16°C. The material was then purified by phenol:cloroform:isoamyl alcohol extraction followed by precipitation with NH4Ac and ethanol. The cDNA was then used in an in vitro transcription reaction for 6 h at 37°C using a T7 IVT kit and biotin-labeled ribonucleotides. The obtained cRNA was purified from unincorporated nucleotides on an RNAeasy column (Qiagen, Valencia, CA). The eluted cRNA was then fragmented by incubation of the products for 2 h in fragmentation buffer (40 mM Tris-acetate, pH 8.1, 100 mM KOAc, 150 mM MgOAc). The final, fragmented cRNA (20 µg) was hybridized to AffymetrixTM chip U74Av2 (Affymetrix, Santa Clara, CA) in 200 µl hybridization buffer [100 mM 2-(N-morpholino)-ethanesulfonic buffer, pH 6.6, 1 M NaCl, 20 mM EDTA, 0.01% Tween 20], supplemented with Herring sperm DNA (100 µg/ml) and acetylated bovine serum albumin (500 µg/ml) in an Affymetrix Gene Chip Hybridization Oven 320. The chip was then developed by the addition of fluorescein isothiocyanate (FITC)-streptavidin followed by washing using an Affymetrix Gene Chip Fluidics Station 400. Scanning was performed using a Hewlett Packard Gene Array scanner.
Data analysis
Probabilistic estimation of gene expression pattern was performed using the Breslin/Bilke method (S. Bilke et al., submitted). Hierarchical tree clusters were generated using the dCHIP program (Li and Wong [13], <http://biosun1.harvard.edu/complab/dchip/>). The initial analysis (see Fig. 2
) was performed using the perfect match (PM)-only model, and genes were filtered according to 0.50 <standard deviation/mean between <10.00, and P call in the array used ≥20%. The pro-B, pre-B, B, and plasma cell lines (see Fig. 4
and Supplemental Figs. 1
2
3
4
available at http://www.jleukbio.org/) were treated as replicates using the same model as above but a P call above 10%. Classification of genes with an apparent restricted expression pattern in the P/A analysis into functional groups was performed manually using the National Center for Biotechnology Information (NCBI; National Institutes of Health, Bethesda, MD) database (see supplementary tables). Nondefined genes in the data set were resubmitted in a Blast search into Genebank allowing for the identification of most of these entries.
![]() View larger version (50K): [in a new window] |
Figure 2. dCHIP analysis of gene expression data suggests that B cell lines belonging to a certain differentiation stage generally display similar expression patterns of stage-specific genes. The figure displays a dCHIP analysis of the RNA expression patterns in the different cell lines selected for our investigations after filtering and hierarchical clustering. The name of the cell line and the earlier classification are displayed above the data panel. The criteria selected for the definition of differentially expressed genes were a 20% present count and a maximum standard deviation of 0.5 (0.50<standard deviation/mean between <10.00). Expression scales ranging from -3 (blue) to +3 (red) are indicated below the data display.
|
|
View larger version (16K): [in a new window] |
Figure 4. Treatment of the data from different cell lines representing the same differentiation stage as replicates allows for the identification of stage-specific genes. The figure displays a dCHIP-generated cluster analysis of differential gene expression in the groups of cell lines after filtering and hierarchical clustering. The identification of control genes within the groups is indicated to the right of the color scheme. The full analysis with gene names of differentially expressed genes can be found as Attachment Figures 1
2
3
4
. Expression scales ranging from -3 (blue) to +3 (red) are indicated below the data display. The figure only displays genes classified as present on more than one chip and to 0.50 < standard deviation/mean between <10.00. TdT, Terminal deoxynucleotidyl transferase; IgHV, Ig heavy-chain variable region; MCH, major histocompatibility complex.
|
![]() View larger version (84K): [in a new window] |
Figure 1. Cell lines arrested in development express stage-specific genes in patterns resembling primary-sorted B lineage cells. (A) Ethidium bromide-stained agarose gels with PCR products obtained by RT-PCR analysis of primary-sorted cells as indicated. The cDNA was diluted in three steps to allow for a degree of quantification of the expressed transcripts. (B) A gel with a RT-PCR experiment displaying the expression of the same genes in a panel of cell lines as indicated.
|
![]() View larger version (28K): [in a new window] |
Figure 3. Investigations of expression pattern of individual control genes indicate that RNA expression data from a group of cell lines can be used for the delineation of stage-specific genes. The figure displays diagrams over normalized expression levels of genes associated with B cell development as indicated on top of each display. The 5 gene is represented by two probe sets. The standard deviations have been calculated by treatment of the cell lines belonging to a defined stage as replicates. Id-1, Inhibitor of DNA binding-1; EBF1, early B cell factor; BCL-6, B cell lymphoma-6; BLIMP, B lymphocyte-induced maturation protein-1.
|
For all PCRs, the program was the same except for the number of cycles (Y) and the annealing temperature (XX). The common parts of the program were (Y cycles) 95°C for 2 min, 95°C for 45 s, XX°C for 45 s, 72°C for 1 min, and 72°C for 2 min. Annealing temperaturescycles were: actin, 57ºC, 25 cycles; hypoxanthine guanine phosphoribosyl transferase (HPRT), 52ºC, 25 cycles;
5, 60ºC, 30 cycles; Pax-5, 60ºC, 30 cycles; J-chain, 60ºC, 28 cycles; Bach1, 55°C, 30 cycles; RhoB, 56°C, 30 cycles; Yes, 55°C, 30 cycles; Sel1, 55°C, 25 cycles; Arhgef, 55°C, 27 cycles; protein kinase C (PKC)-ß, 55°C, 28 cycles; and Pftaire-1, 55°C, 30 cycles.
Oligonucleotides used for RT-PCR were: actin, sense 5'GTTTGAGACCTTCAACACC, antisense 5'GTGGCCATCTCCTGCTCGAAGTC; B29, sense 5'GGTGAGCCGGTACCAGCAATG, antisense 5'AGTTCCGTGCCACAGCTGTCG;
5, sense 5'TGTGAAGTTCTCCTCCTGCTC, antisense 5'ACCACCAAAGTACCTGGGTAG; Pax-5, sense 5'CTACAGGCTCCGTGACGCAG, antisense 5'GTCTCGGCCTGTGAAATAGG; Bach-1, sense 5'ACTCTCAGTTCCGTCAACTGC, antisense 5'TTCCTCTTGCGACAGCGTTGC; Arhgef3 (EST-1), sense 5'AAACATCCGTCCACTCTCTTCC, antisense 5'TACTGTACACATGGGTCATGTGC; Pftaire-1, sense 5'TGCTCTAGCATACATTGAACC, antisense 5'CTCCCCACTTAAAGAACTCC; PKC-ß-II, sense 5'ATCCACCAGTCCTAACACC, antisense 5'AAGCAAGCATTTTCTCTCC; RhoB, sense 5'CTGATCGTGTTCAGTAAAGACGAATTCC, antisense 5'TTGTTGGCCACCAGGATGATGG; Yes-associated protein, sense 5'GCAGTTACAGATGGAGAAGGAG, antisense 5'TTGCATCTCCTTCCAGTGTGC; HPRT, sense 5'GCTGGTGAAAAGGACCTCT, antisense 5'CACAGGACTAGAACACCTGC; J-chain, sense 5'GTAGGTGGTACCTATACAATAACA, antisense 5'AGGGTAGCAAGAATCGGGGGTCAA.
Isolation and purification of BM progenitors and mature peripheral B cells
BM cells were sorted on a FACSVantage Cell Sorter (Becton Dickinson, San Jose, CA), equipped with a 488-nm argon ion (Coherent Enterprise II, Santa Clara, CA) and a 633-nm He-Ne (Model 127, Spectra-Physics, Mountain View, CA) laser. Antibodies used were B220 antigen-presenting cell, CD43 phycoerythrin (PE), IgM biotin (Streptavidin TRI), CD19 FITC, and CD138 (Syndecan1) PE (all from PharMingen, San Diego, CA). The purity of all sorted cell populations is reproducible over 95%. To obtain activated and mature B cells, magnetic cell sorter-purified B220+ spleen cells were incubated in 50 ng/ml lipopolysaccharide (LPS; Sigma Chemical Co., St. Louis, MO) at 37°C for 72 h.
|
|
|---|
5 [18
] was expressed mainly in pre-B cells. Some
5 expression was also detected in the Syndecan-1+ B220- BM cells representing primary plasma cells. This probably reflects contamination with pre-B cells rather than true expression of this gene at this late developmental stage, as no
5 expression could be detected in the LPS-stimulated splenocytes. The expression of the transcription factor Pax-5 [19
] was high in the pre-B and B cells, and it appeared to be lower in the plasma-cell populations. In contrast, the Ig-associated J-chain was expressed at a higher level in the plasma cells than in the other populations [20
]. Analysis of the expression pattern of the same genes in a selection of cell lines (Fig. 1B) indicated that all these expressed the B lineage-restricted B29 (Igß) gene, and only the pre-B cell lines expressed
5 message. Pax-5 was not expressed in the pro-B cell lines or in the plasma cell lines, and the pre-B and B cell lines expressed this transcription factor. Message encoding the J-chain was present in one of the pre-B cell lines (230238) and in cell lines representing the mature or the plasma cell stage. This suggests that the cell lines display expression of B cell markers in patterns comparable with those observed in primary sorted cells.
To expand the analysis of the gene expression patterns in the different B lineage cell lines, we analyzed RNA from these on AffymetrixTM gene chip microarrays containing
12,000 sequence tags. The data were then analyzed using the dCHIP program [13
], allowing for the identification of differentially expressed genes in the cell line samples (Fig. 2
). This analysis indicated that although there were differences in expression levels, the cell lines previously defined as belonging to a specific stage of development generally displayed a similar cluster of differentially expressed genes. The exception was the B cell line WEHI231, which appeared to group with the pre-B cells rather than with the B cells, possibly reflecting that this cell represents an immature BM-derived B cell rather than a mature peripheral cell [21
22
]. The homogeneity in expression levels within the cell line groups representing the different stages was also analyzed by the extraction of normalized expression levels for a set of genes linked to B cell development [1
3
4
5
6
] (Fig. 3
). The expression of the transcription factor Id-1 was reduced upon progression into the pre-B cell stage, and the levels of the signal transducer B29 was increased and maintained in all the cell line groups. mRNA encoding
5, V-preB1, VpreB-3, and the IL-7 receptor
subunit were all transiently up-regulated in the pre-B cell lines. Mb-1, EBF, and CD19 expression was high in the pre-B cells, but the mRNA could also be detected in the B cell lines. CD24a [human serum albumin (HSA)] was expressed in the pre-B and mature B cells as was PKC-ß. The germinal center-specific transcription factor BCL-6 [23
24
] was expressed specifically in the B cell lines, and the Ig-associated J-chain was expressed in the B and the plasma cells. Only the latter cells expressed the plasma-cell transcription factor BLIMP [25
]. Thus, although the standard deviations in some cases were substantial, these data indicate that the cell lines representing a specific developmental stage present rather homogenous gene expression patterns and support the idea that stage-specific genes can be identified using a series of B cell lines. To reduce the impact of cell line-specific features, we then treated all cell lines belonging to a certain differentiation stage as replicates and performed another dCHIP analysis. This resulted in the identification of a large number of genes, including several genes with previously defined expression patterns, as expressed in a stage-specific manner (Fig. 4
and AttachmentFigs. 1
2
3
4
).
AffymetrixTM P/A analysis allows for the characterization of stage-restricted gene expression
The design of the AffymetrixTM microarrays, with one set of matching and one set of mismatching oligonucleotides for each gene, allows for a comparison of the obtained signals from the two probe sets. The data are then evaluated by AffymetrixTM array analysis software, allowing for the classification of all the studied genes as present (P) or absent (A) in each of the samples. This transforms the data set into binary values creating novel possibilities for mathematical analysis of the obtained data. This does, however, delete all information about relative expression levels; so to investigate if binary values could be used for the identification of stage-specific genes, we constructed a Hamming distance matrix based on P/A analysis. This generates a measure of similarity between any two samples based on the number of genes for which the AffymetrixTM P/A calls differ (Fig. 5
). The pro-B and plasma cell groups were the most homogenous, and the B cell group displayed a poorer similarity. M12, K46, and A20 appeared similar, while the WEHI231 cells rather resembled the pre-B cells. This indicates that P/A analysis allows for the correct stage classification of the cell lines and that this analysis method could be used to obtain information about stage-specific gene expression.
![]() View larger version (81K): [in a new window] |
Figure 5. P/A analysis allows for stage determination of B cell lines. The figure shows a distance matrix based on P/A analysis of the cell lines used for the generation of the data set. Ba/F3 1, 2, and 3 are differentially obtained subclones of the pro-B cell line Ba/F3, and Ly9D is an independently generated pro-B cell clone. 230238, 40E1, and 1881 are pre-B cell lines generated by Abelson virus transformation. 70/Z3 is a nitrosurea-induced pre-B cell line. WEHI231, A20, K46, and M12 are all defined as B cell lines, and S194, J558, SP2.0, and MPCII represent plasma cells. The scale ranges from highest similarity (black) to lowest (white).
|
|
View this table: [in a new window] |
Table 1. dCHIP and Probabilistic State Analysis Can Be Used As Complementary Methods for Gene Expression Analysis
|
![]() View larger version (62K): [in a new window] |
Figure 6. Calculated gene expression patterns can be verified by RT-PCR analysis. The figure displays agarose gels, with the PCR products obtained using primers amplifying genes predicted to display restricted expression patterns (Attachment Figs. 1
2
3
4
and supplementary tables). The identities of the amplified mRNAs are indicated to the right, and the cell line used to generate cDNA is indicated on top of the panel. The PCR product has been visualized by ethidium bromide staining.
|
|
|
|---|
Expression analysis of pre-B cell lines suggests simultaneous stage-restricted expression of several nonrearranged IgH genes
Another aspect of B cell development that does not become apparent when using sorted primary cell populations for gene expression analysis is reflected in the detection of RNA encoded by several V-region, heavy-chain (VH) genes, including pseudo-genes, specifically in the pre-B cell lines (Table 1)
. As the cell lines are of clonal origin, this could indicate that one and the same pre-B cell has the ability to express several VH genes simultaneously. These transcripts were in most cases not detected at the later stages of differentiation, and the mature cells, to a larger extent, expressed IgVL (V-region, light chain) genes. This is likely to reflect an ongoing rearrangement process of the heavy-chain gene in pre-B cell lines [40
], with sterile expression of VH genes making them accessible for the recombination machinery [7
41
]. The expression of these V genes appears to be silenced at the later stages of development, possibly to ensure that no additional rearrangements of the heavy-chain genes occur during the assembly of the light-chain genes [8
]. It may also be a mechanism contributing to allelic exclusion of the heavy chain to ensure that each single B cell only expresses one type of surface-bound Ig to avoid cross-reactive immune responses [8
]. Thus, there may be a biological necessity in this rather complicated expression pattern, possibly demanding differential regulation of IgH promoters during B cell development. This type of information would not be extracted from the use of primary, sorted cells, as a broad expression of Ig genes could be explained by the heterogeneity of the sorted cell populations.
The AffymetrixTM P/A analysis is useful for the identification of stage-specific genes
Although the general picture of gene expression patterns was the same, independently of which method we used for the analysis of our data, the results from the (dCHIP) analysis differed to some extent from that obtained by P/A-based, probabilistic state analysis. As the dCHIP analysis takes into regard the relative transcription levels, there will be one group of genes that is expressed at all developmental stages but at different relative levels, which will be detected using dCHIP but not P/A analysis. These genes will be classified as present in all the groups and therefore not be detected as stage-specifically regulated in a P/A analysis. A bit more surprising was the detection of genes by the P/A-based method that we could not get classified in the dCHIP analysis. This is probably a result of that fact that rather small changes in relative expression values could change the classification from absent to present. Such an alteration might be classified as insignificant in the dCHIP analysis. This means that the P/A method gives a higher sensitivity of the data analysis, but at the same time, it will also increase the probability of detecting nonregulated genes. However, it appears that we in some cases detect different control genes using different analysis methods, indicating that the two approaches are complementary to each other.
We have not performed any extended analysis of the data we obtained as a result of the large amounts of information and the validity of the expression profile, as any individual gene needs further investigation. The analysis does, however, provide information that can be used to create a preliminary map of gene expression patterns that can be used to formulate working hypotheses for complex molecular events in B cell development.
Received January 9, 2003; revised March 6, 2003; accepted March 24, 2003.
|
|
|---|
enhancer function Genes Dev 5,880-894This article has been cited by other articles:
![]() |
K. Anderson, C. Rusterholz, R. Mansson, C. T. Jensen, K. Bacos, S. Zandi, Y. Sasaki, C. Nerlov, M. Sigvardsson, and S. E. W. Jacobsen Ectopic expression of PAX5 promotes maintenance of biphenotypic myeloid progenitors coexpressing myeloid and B-cell lineage-associated genes Blood, May 1, 2007; 109(9): 3697 - 3705. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Jacquelin, T. Kortulewski, P. Vaigot, A. Pawlik, G. Gruel, O. Alibert, P. Soularue, C. Joubert, X. Gidrol, and D. T.-L. Roux Novel pathway for megakaryocyte production after in vivo conditional eradication of integrin {alpha}IIb-expressing cells Blood, September 15, 2005; 106(6): 1965 - 1974. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Mansson, P. Tsapogas, M. Akerlund, A. Lagergren, R. Gisler, and M. Sigvardsson Pearson Correlation Analysis of Microarray Data Allows for the Identification of Genetic Targets for Early B-cell Factor J. Biol. Chem., April 23, 2004; 279(17): 17905 - 17913. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||