Published online before print June 12, 2006
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
,
,1
,
,
,1
,
,
,
,
,¶



,
,2
* Haematology and Oncology Program, Child Health Research Institute, North Adelaide, South Australia;
The Queen Elizabeth Hospital, Woodville, South Australia; Departments of
Paediatrics and
** Medicine and
School of Mathematical Sciences, University of Adelaide, South Australia;
¶ The Division of Human Immunology and Hanson Institute, Institute of Medical and Veterinary Sciences, Adelaide, South Australia;
|| The Genetics and Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Melbourne, Victoria, Australia; and

Cancer Biology Program, Centre for Immunology and Cancer Research, Princess Alexandra Hospital, Woolloongabba, Queensland, Australia
2Correspondence: Child Health Research Institute, 7th Floor, Clarence Rieger Building, 72 King William Road, North Adelaide, South Australia 5006, Australia. E-mail: Richard.dandrea{at}adelaide.edu.au
|
|
|---|
(C/EBP
) target genes and generates changes characteristic of a specific acute myeloid leukemia signature, defined previously by gene-expression profiling and associated with C/EBP
mutations.
Key Words: myeloid transcription factor myeloid leukemia microarray gene expression
|
|
|---|
Given the pivotal roles of HGF signaling in regulating hematopoiesis, it is not surprising that aberrant HGF receptor activation or activation of pathways downstream of HGF receptors is associated with leukemia. A key example of this is activation of fms-related tyrosine kinase 3 (FLT3), which represents the most commonly mutated gene in acute myeloid leukemia (AML) and is constitutively activated by acquired mutation (most commonly, internal tandem duplications) in 3035% of AML cases [8
, 9
]. Aberrant HGF signaling in AML cells can also result from constitutive activation of other receptors [10
11
12
] or signaling molecules [13
14
15
]. Autocrine production of GM-CSF and IL-3 also occur occasionally in AML [16
] and chronic myeloid leukemia [17
] and also result in constitutive activation of proliferation and survival pathways. The current model for pathogenesis of AML involves cooperation between these mutations and others, which leads to a block in differentiation or acquisition of self-renewal. This is associated with disruption of the normal transcriptional program, sometimes as a result of lesions involving key myeloid TFs such as C/EBP
and PU.1 [1
] but commonly through the interference of translocation-derived fusion proteins [18
]. There are overlaps between the effects of these two cooperating classes of mutation, as for example, FLT3 activation appears to contribute to a differentiation block in some contexts [19
].
With the advent of new, gene-profiling technologies, it has become possible to investigate the molecular basis of myeloid leukemias using approaches that measure global gene-expression changes. Over recent years, there has been a large mass of data generated with respect to the gene-expression profiles of AML patient samples [20
21
22
23
], sets of genes downstream of leukemogenic TFs [24
, 25
] and associated with FLT3 mutations [19
, 26
27
28
]. Such profiling needs to be considered together with the gene-expression patterns available for myeloid cell line models undergoing directed differentiation [29
30
31
32
] and granulocytic populations at different stages [33
]. In many of these analyses, gene-expression changes cannot generally be associated, a priori, with a specific cellular process (e.g., mitogenesis, promoting, or blocking differentiation, survival, self-renewal), as in these systems, many processes are occurring simultaneously. Dissection of gene-expression changes associated with each cellular outcome requires differential activation of these processes in parallel cell systems, such that gene-expression profiles can be monitored simultaneously. With such a system, a linear modeling approach permits identification of different classes of genes based on such a complex set of conditions. With this in mind, we turned to a bipotential cell line model of myeloid differentiation, which displays differential responses to GM-CSF, IL-3, and activated GM-CSF receptor mutants. The FDB1 cell line is strictly growth factor-dependent for survival; however, cells proliferate continuously in the presence of IL-3 with only a minimal amount of spontaneous differentiation to neutrophils, monocytes, and megakaryocytes. In the presence of GM-CSF, FDB1 cells differentiate synchronously along the neutrophil and monocyte lineages with complete differentiation after 57 days [34
]. This ability to uncouple mitogenesis and self-renewal from differentiation while still using physiological stimuli allows a dissection of these processes, not readily achieved in primary cells, which undergo simultaneous proliferation and differentiation and eventual growth arrest and cell death. To increase the power of this study, we included two activated mutants of the GM-CSF receptor, which have differential leukemogenic activity. These two mutants, FI
and V449E, are derived from the common signaling subunit (hßc) for GM-CSF, IL-3, and IL-5 (reviewed in Gonda and DAndrea [35
] and DAndrea and Gonda [36
]) and represent two distinct classes of activated receptor [extracellular (EC); transmembrane (TM). The two mutants display overlapping, biochemical responses compared with the GM-CSF and IL-3 receptor complexes [37
] and most likely, represent alternative receptor configurations [35
]. In vivo, V449E induces a myeloid leukemia consistent with its ability to support the generation of immature myeloid cell lines in vitro [38
], and the EC mutant FI
induces cytokine-independent formation of CFU-GM [colony-forming units-GM] and erythroid progenitor colonies and leads to a myeloproliferative disease in murine models [38
, 39
]. It is important that the FDB1 cell line also responds differentially to these activated GM-CSF receptor mutants. V449E induces factor-independent proliferation of FDB1 cells and is able to block GM-CSF-induced differentiation, properties that mimic the ability of this mutant to induce AML in vivo. FI
induces factor-independent GM differentiation, reflective of the signals that give rise to myeloproliferative disorder in vivo [34
, 40
]. Thus, FDB1 provides a manipulable model with which to study downstream signaling and transcriptional responses from cytokines and activated receptors. Here, we have used this model, time-course gene-expression profiling, and linear modeling to dissect the molecular mechanisms underlying the differential activities associated with the myeloid growth factors GM-CSF and IL-3 and the activated mutants of the GM-CSF receptor. With a view to understanding the molecular control of myeloid differentiation, we focused our downstream and clustering analysis on identification of candidate regulatory genes encoding TFs and TF-associated products. In addition, we have used comparative analysis between our data and other studies focused on C/EBP
and AML to examine further the mechanism of leukemic receptor signaling. Here, we provide evidence for a mechanism of leukemic induction by activated growth factor receptor mutants, which may involve modulation of C/EBP
activity.
|
|
|---|
Culture and analysis of FDB1 cells
FDB1 cells were maintained as described previously [34
]. Infected pools were maintained as described plus 1 µg/mL Puromycin (Sigma Chemical Co., St. Louis, MO). Receptor expression (FI
or V449E) was confirmed by staining with a murine anti-FLAGTM antibody (Sigma Chemical Co.) followed by an anti-mouse fluorescein isothiocyanate-conjugated antibody (Silenus, Chemicon International, Temecula, CA). Cells were analyzed by flow cytometry using an Epics Elite ESP (Coulter Electronics, UK). If necessary, cells were sorted for expression as described previously [41
]. For the time courses, cells were washed three times in Iscoves modified Dulbeccos medium and cultured in 500 BM U/mL mIL-3 or mGM-CSF or without growth factor {1 BM unit is the amount that gives 50% maximal colony formation (CFU-GM) using bone marrow cells; see ref. [42
]}. To assess differentiation, cells were centrifuged onto slides and stained with May-Grünwald-Giemsa, and the proportion of differentiated cells was determined microscopically. To assess cell viability, the percentage of cells excluding trypan blue was determined using a hemocytometer.
Experimental design
The experiment is arranged as a factorial design studying cell lines and time. For time-course information, each cell population was sampled at six time-points (0, 6, 12, 24, 48, and 72 h), and the time zero (T0) point was used as a reference to obtain relative expression levels following IL-3 withdrawal. We also included the parental FDB1 cell line maintained in IL-3 to establish any baseline differences at T0 as a result of expression of the activated receptors. Matched time comparisons were also performed at all six time-points between the two conditions supporting differentiation (GM-CSF and FI
) and between the two cell populations expressing activated receptor mutants (FI
vs. V449E). Two biological replicate samples were used for each cell population, and a dye-swap replicate was performed for each comparison.
Hybridization and data analysis
Total cellular RNA was harvested with Trizol (Invitrogen, Carlsbad, CA) and further purified with the RNeasy RNA purification kit (Qiagen, Valencia, CA). Aliquots of RNA were kept to perform quantitative reverse transcriptase-polymerase chain reaction (QRT-PCR) analysis. cDNA was generated using 50 µg total RNA as a template and primed with PolyT(V)N (4.0 µg) and random hexamers (1.0 µg, Amersham, UK). cDNA was labeled with Cy5 or Cy3 dyes using the Cy-Scribe post-labeling kit (Amersham) as per the instructions. Labeled cDNA was hybridized to microarray slides printed with the CompuGen Mouse OligoLibrary (v2.0, 21,997 65-mers comprising 21,587 unique genes) by the Adelaide Microarray Facility (Australia). Slides were scanned using a GenePix 3000B scanner (Axon Instruments, Sunnydale, CA), and the Spot package (CSIRO, Australia) was used to identify spots and estimate fore- and background intensities (using a morphological opening background estimator) [43
, 44
]. Data analysis was performed in R (www.r-project.org) using the Limma package of Bioconductor [45
, 46
] and in-house scripts. Arrays were normalized using intensity-dependent spatial normalization and scale normalization [47
]. Spatial loess was also applied to a subset of arrays [48
]. Linear modeling and F-test-based classifications were performed with the Limma package of bioconductor [45
]. We estimated, effects for baseline (T0) differences between cell lines change over time in FI
(0, 72 h) and sample by time-interaction parameters between FI
and V449E and between FI
and GM-CSF. To examine the distribution of differentially expressed genes for a given comparison of interest, we constructed a volcano plot, in which we plotted log2(fold change) on the x-axis and the log10[false discovery rate (FDR)-adjusted P value] on the y-axis. This generates a volcano-like shape, in which genes at low fold changes are typically not differentially expressed (P values close to 1; log(p) approaching zero), and those with strong evidence for differential expression typically have high fold-change values. Gene ontology (GO) over-representation analysis was performed using GO-STAT using the CompuGen Library as the comparison set and applying a FDR P value adjustment [49
]. Promoter analysis was performed using the CUREOS database (http://cureos.wehi.edu.au) using Transfac matrix M00770 (V$CEBP_Q3, www.biobase.de) and requiring mouse and human homology. A
2test was used to test for over-representation of consensus-binding sites. Additional information may be found under Gene Expression Omnibus (GEO) Accession GSE3333. We have followed the guidelines set out by the Microarray Gene Expression Data Society (www.mged.org/miame).
Clustering of TFs
GO terms were obtained via the SOURCE website (http://source.stanford.edu, UniGene build 139) or via direct query of the National Center for Biotechnology Information gene database. To increase sensitivity further to potential TFs, we used homologene to extracted GO terms for human homologs. We selected all genes, which were children of the terms transcription (GO:0006350), nucleoplasm (GO:0005654), DNA binding (GO:003677), or transcription regulator activity (GO:0030258) or contained the terms "transcription," "histone," or "chromatin" in their GenBank Description. A total of 2338 genes was selected as transcription factor-associated. This was reduced down to 340 after filtering out genes with nonsignificant F-test values (1x105). Clustering was performed using the Diana algorithm (R cluster package). We evaluated the distance between two profiles using a measure based on that of Bar-Joseph [50
] for comparing two time profiles. Briefly, for a gene i, a smoothed spline curve C was fitted to each of the five time-course and matched time data sets. The distance between two genes (i and j) for time course p was calculated as
, and the total distance was then calculated as
. The dendrogram was then cut at a height of 1.25 to define 14 clusters. Full annotation information of genes in each cluster is listed in Supplementary Table 1.
QRT-PCR
Aliquots of RNA prepared for the microarray analysis were subjected to real-time RT-PCR analysis. For this, RNA was treated with DNase (Ambion, Austin, TX), reverse-transcribed with Oligo-dT (Ambion) using Omniscript RT (Qiagen). The sequences of the oligonucleotides used for PCR are listed in Supplementary Table 2. Gene-specific PCR reactions were performed for 38 cycles using Amplitaq Gold (Perkin Elmer, Wellesley, MA) and recommended conditions. SYBR green (10x; Molecular Probes, Eugene, OR) was added to a final concentration of 0.6x per reaction, which was performed on the Rotor-Gene 3000 and related software used for data collection and to determine mean expression values relative to Cyclophilin A (Corbett Research, Version 5.0). Amplification products were analyzed by melt curve and resolved on 2% agarose to confirm specificity. Analysis was performed using a mixed-effects linear model to estimate T0 differences between each cell line and mean changes within cell lines over time, treating PCR run as a blocking variable.
Retroviral transduction and functional analysis in the FDB1 cell line
For functional analysis, the full-length c-Myb cDNA was cloned into the murine stem cell virus (MSCV)-internal ribosome entry site (IRES)-green fluorescence protein (GFP) retroviral vector and FDB1 cells infected by cocultivation as described previously [40
]. Following infection with virus-producing cells, GFP+ FDB1 cells were isolated by flow cytometry and expanded in mIL-3. For testing, cells were withdrawn from IL-3 and monitored for morphological differentiation in the presence and absence of GM-CSF for 5 days, as described previously [40
]. Changes in cell morphology were recorded following cytocentrifugation of cells onto a glass slide and staining with May-Grünwald-Giemsa.
Association of the V449E gene set with leukemia subsets identified by gene profiling
We downloaded the (MAS5) normalized data used by Valk et al. [51
] from GEO (GSE1159-21979). The normalized data were loaded into R for analysis with the Limma package, and the patients were divided into the 16 clusters (groups) as defined in ref. [51
]. For each cluster, we compared expression in patients against normal controls, and for each gene obtained, FDR adjusted P value. We then mapped the V449E-associated gene set (see
Table 4
) to the probes on the HG-U133A GeneChip. After cross-species mapping, the V449E-associated data set reduced from 44 probes to 30. Then, for each of the 16 clusters identified in this study, we looked for association between our V449E gene set and genes differentially expressed in this cluster. We ranked all genes on the array on the basis of FDR-adjusted P values and then examined the distribution of ranks of the 30 Affymetrix probes homologous to our V449E gene set. Significance was assessed using a Wilcoxon rank sum test as implemented in R.
|
View this table: [in a new window] |
Table 1. Genes Associated with Granulocyte and Macrophage Differentiation (Full Annotation and Linear Modeling Results May Be Found in Supplementary Table 3)
|
|
View this table: [in a new window] |
Table 2. Known Myeloid Differentiation and Proliferation-Associated Genes1
|
|
View this table: [in a new window] |
Table 3. Genes Associated with Myeloid Cell Proliferation and Self-Renewal (Full Annotation and Linear Modeling Results May Be Found in Supplementary Table 4)
|
|
View this table: [in a new window] |
Table 4. V449E Down-Regulated Genes (Full Annotation and Linear Modeling Results May Be Found in Supplementary Table 6)
|
|
|
|---|
and V449E [40
]. A similar level of expression of the hßc mutant in each population was confirmed by staining for the FLAG epitope fused to the N terminus of the mutant ß subunit (Fig. 1A
). For each condition, changes in morphology, growth, and viability were quantitated (see Fig. 1B
and 1C
). The FDB1 cells behaved as described previously [34
, 40
], and complete differentiation to granulocytes and macrophages (approximately equal ratio) occurred over 5 days in GM-CSF and in response to the FI
signal. The switch from IL-3 to V449E signaling did not result in any discernible change to morphology, growth, or viability of the FDB1 cells.
![]() View larger version (51K): [in a new window] |
Figure 1. FDB1 experimental system. (A) Cell surface expression of the hßc-activated mutants on FDB1 cells as determined by flow cytometry. Transduced cells (solid peaks) were stained with an anti-FLAGTM monoclonal antibody and compared with identically stained, untransduced cells (open peaks). (B) Photomicrographs of cells induced to self-renew or differentiate over 5 days in the presence of the indicated growth factors or through the activity of the hßc-activated mutants. Original magnification, 400x. (C) Differential counts performed with the cell populations used for RNA preparation. Black, Blast cells; light gray, granulocytes; dark gray, monocytes; white, intermediately differentiated cells.
|
). This allowed a parallel time-course comparison to reveal gene-expression changes associated with the switch to alternative differentiation-inducing signals (GM-CSF or FI
) or signaling via the leukemia-inducing V449E mutant. In addition, we performed comparisons between cell populations at matched time-points to reveal differences in gene expression between cells undergoing differentiation or proliferation/self-renewal. The overall experimental design, incorporating matched-time and time-course comparisons (along with dye swaps and biological replicates), is shown schematically in Figure 2
and was based on admissible design criteria [52
]. A major strength of this design is that it allows estimation of genes with differential expression in the initial cell populations (i.e., T0 differences) and time-related changes within a cell population (main effects). It is important that this design also permits identification of genes with significantly different expression profiles between cell populations over time (interaction effects).
![]() View larger version (19K): [in a new window] |
Figure 2. Experimental design. Parental FDB1 cells grown in IL-3 were washed and cultured for up to 3 days in the presence of GM-CSF. In parallel, FDB1 cell populations expressing FI or V449E were washed and cultured in the absence of growth factor. RNA was harvested from these populations at 0, 6, 12, 24, 48, and 72 h. Comparisons performed are represented by arrows (double-headed arrow indicates the use of dye-swap comparisons). All comparisons presented were performed four times (i.e., two replicate cultures, two dye swaps, total slides 112). Differential cellular outcomes associated with each condition are indicated on the right. P, Proliferation; S, survival; D, differentiation; L, leukemic signaling.
|
and/or the GM-CSF cell population and for which this change was significantly greater than the change (if any) in the V449E cell population. The criteria used to select proliferation and self-renewal-associated genes were identical, except that we required decreases in gene expression rather than increases. Using an F-test with a FDR-adjusted P value cut-off of 1 x 105 and a minimum change of 1.4-fold, we identified 205 differentiation-associated genes and 175 proliferation or self-renewal-associated genes. These gene sets are represented in volcano plots in Figure 3
, which provide a visual indication of the relative evidence for differential expression of a given gene. Figure 3A
shows a clear increase in the number of genes displaying differential expression over time, based on FDR-adjusted P value, in the parental FDB1 population shifted to GM-CSF and the FI
population. Although few significant changes are observed at the 6-h time-point, clear groups of differentiation-associated (red) and proliferation-associated (green) genes are identified by this analysis over the 72-h time-course. To examine V449E signaling as a model of leukemic-activated cytokine receptor signaling, we compared the gene-expression profiles of the parental FDB1 cells and the FI
population (in IL-3) with the V449E cell population under the same conditions. Using an F-test with a FDR-adjusted P value cut-off of 1 x 105 and a minimum change of 1.4-fold, we identified 44 genes specifically down-regulated by the V449E mutant (shown in purple in Fig. 3B
). It is interesting that this gene set is strongly differentiation-associated, as indicated by the increased expression of these genes over time in the parental cells responding to GM-CSF and in the FI
population (Fig. 3A)
. This is most likely reflective of the ability of V449E to block differentiation of myeloid cells, a property that is crucial for induction of AML in vivo.
![]() View larger version (31K): [in a new window] |
Figure 3. Volcano plots of evidence for differential expression. Plot shows log10(FDR-adjusted P value) versus log2 (fold change). (A) Distribution of genes for each cell population at 6, 24, and 72 h (each compared with 0 h). (B) Baseline (0 h, IL-3) differences between each cell population. In each plot, the set of 205 differentiation-associated genes is represented by red points, the 175 proliferation and self-renewal-associated genes by green points, and the 44 genes specifically down-regulated in V449E at 0 h by purple points. The dashed gray line represents a FDR-adjusted P value cut-off of 0.01. We see that in FI and GM-CSF populations, many of the highlighted genes show early evidence of change, and changes become more significant over time. Typically, the genes down-regulated in V449E at T0 stay down-regulated at subsequent time-points, and they display increased expression over time in the FI and GM-CSF FDB1 populations (A).
|
![]() View larger version (44K): [in a new window] |
Figure 4. Comparisons of QRT-PCR and microarray data. For each gene, comparisons were made at 0, 24, and 72 h. In each plot, microarray results are represented by smoothed spline profiles with V449E by a green, dashed line; FI by a blue-dot, dashed line; and GM-CSF by a pink, long, dashed line. Baseline (T0) comparisons are represented by points as follows: cyan points, V449E compared with parental FDB1; purple squares, FI compared with parental FDB1. RT-PCR results are represented by points with bars representing 95% confidence intervals. Dark green circles, V449E; navy triangles, FI ; red diamonds, GM-CSF. Light blue circles, V449E compared with parental FDB1 at T0; dark purple circles, FI compared with parental FDB1. (A) Genes associated with GM differentiation; (B) genes associated with proliferation and self-renewal. Primer sequences used are presented (see Supplementary Table 5).
|
and/or the GM-CSF population and stable expression in the V449E population. Table 1
lists all 205 differentiation-associated genes together with their fold-changes and FDR P values over 72 h in the FI
cell population (changes in GM-CSF were similar and are supplied along with full gene annotation information in Supplementary Table 3). Example gene-expression profiles are shown in Figure 5A
. GO analysis of this gene set indicated significant over-representation of terms such as defense response (GO: 0006952, P=2x108), response to wounding (GO: 0009611, P=5x106), and chemotaxis (GO: 0006935, P=5x105), consistent with up-regulation of genes involved in mature granulocytes and macrophages. Many of the genes identified in this analysis are known to be associated with myeloid differentiation (Table 2
), thus providing an important validation of this cell-line model of myeloid differentiation and confirming the hypothesis that genes displaying concordant regulation in response to the FI
mutant and GM-CSF will be of most relevance to GM differentiation in vivo.
![]() View larger version (41K): [in a new window] |
Figure 5. Examples of proliferation and differentiation-associated gene-expression profiles. (A) Genes associated with GM differentiation. (B) Genes associated with proliferation and self-renewal. In each plot, points represent normalized microarray fold-change values (one point per array), and lines represent weighted, smoothed spline estimates of the change over 72 h. Upper plots: green circles and dashed line, V449E profile; blue triangles and a dot-dash line, FI profile; red squares and a solid line, V449E versus FI direct comparisons. Cyan points, V449E compared with parental FDB1 at T0 (i.e., in IL-3). Lower plots: blue triangles and a dot-dash line, FI ; pink diamonds and long, dashed line, GM-CSF; purple squares and a solid line, FI versus GM-CSF direct comparisons.
|
population or in cells responding to GM-CSF. Table 3
lists the 175 proliferation-associated genes (for full gene annotations, see Supplementary Table 4). Several of the genes in Table 3
have known ability to block differentiation or have a reported association with an immature hematopoietic cell phenotype (Table 2)
. Example gene-expression profiles are presented in Figure 5B
. In particular, the proto-oncogene c-Myb has known roles in regulation of myeloid cell proliferation (for review, see ref. [53
]), and the microarray profile for c-Myb was verified using Northern analysis (see Supplementary Fig. 1). As a representative of this class of genes and to test the capacity of the FDB1 cell line to respond to a known myeloid oncogene, we used a retroviral vector to enforce expression of c-Myb in FDB1 cells. Figure 6
shows that FDB1 cells infected with a MSCV-IRES-GFP retroviral vector expressing c-Myb are blocked in differentiation in the presence of GM-CSF. In addition, we have shown that these cells can be maintained in GM-CSF long-term, consistent with overexpression of c-Myb generating a complete block in myeloid differentiation. Although this is not unanticipated, given the transforming capacity of Myb [54
], nevertheless, it highlights the ability of our expression analysis to identify genes that function as hematopoietic regulators and the ability of such genes to perturb growth and differentiation in the FDB1 model system.
![]() View larger version (82K): [in a new window] |
Figure 6. Growth and differentiation effects of enforced c-Myb expression in FDB1 cells, which were transduced with a MSCV-IRES-GFP retrovirus encoding c-Myb or with control retrovirus and selected on the basis of GFP expression. Cells were washed to remove growth factor and then cultured in mIL-3 or GM-CSF for 5 days.
|
GO and divisive clustering to identify novel candidate myeloid transcriptional regulators
We next used this comprehensive data set to perform a level of gene prioritization aiming to highlight key genes with potential regulatory function. This process greatly facilitates the future functional analysis required to establish cause and effect for any genes of interest. With a view to identification of key myeloid regulators, we used GO terms to identify 340 TFs and TF-associated proteins displaying differential regulation consistent with a potential regulatory role (see Materials and Methods for details). Proteins in these categories can influence cell fate and the switch between proliferation and differentiation [55
] and are often involved in leukemic translocations resulting in fusion proteins that have the capacity to reprogram myeloid cell behavior [1
]. As TFs with regulatory roles in hematopoiesis often have well-defined patterns of expression, we used divisive, hierarchical clustering on gene-expression profiles under all conditions to group together genes that display coordinate regulation. The clustering approach identified eight clusters (237 genes) associated with proliferation (down-regulated during differentiation) and six clusters (103 genes) associated with differentiation (see Supplementary Table 1 for full gene annotation information). Within these divisions, clusters grouped together genes that had a similar rate or magnitude of gene-expression change. A heat map showing all clusters is presented in Figure 7
. To select genes of particular interest from this group, we focused on genes that clustered with genes known to have roles in myelopoiesis or leukemogenesis. This information was taken together with evidence available from the literature (Entrez PubMed, iHOP [56
]) to generate a group of genes with a high probability of having regulatory capacity in the myeloid lineage. These genes are discussed briefly below. Relevant references are summarized in Supplementary Table 5.
![]() View larger version (75K): [in a new window] |
Figure 7. Heat map displaying TF-associated clusters. Heat map showing mean expression level for each comparison performed for 340 genes, significantly, differentially expressed at 0, 6, or 72 h. These genes were identified as TF-associated, based on GO classification or gene description. The tree obtained from divisive clustering is presented on the left-hand side of the plot. Fourteen clusters were defined by cutting the tree at a height of 1.25 and are indicated using the colored bands on the left-hand side. Numbering of the clusters starts from the bottom (red) cluster and proceeds upward (Cluster 6, dark blue; Cluster 9, chartreuse; Cluster 10, medium blue; Cluster 12, light blue). The 340 rows represent individual probes on the array, and the 28 columns represent the points in the time-course analysis. Heat map values are the expression values estimated using a linear modeling approach scaled across rows, and red indicates up-regulation, and green indicates down-regulation. The size of each band gives an indication of the magnitude of the change, and color gives an indication of the direction and magnitude of the change. (Full annotation details may be found in Supplementary Table 1.) Top horizontal band indicates the microarray comparison (V/F indicates matched time comparisons of the V449E cell population against the FI population; G/F indicates matched time comparison of the GM-CSF treated cells against the FI cell population; IL-3 indicates the comparison of the V449E cell population with parental cells in IL-3). Bottom band indicated the time point.
|
|
|
|---|
Novel regulators of myeloid differentiation
We also studied a number of transcription-associated factors, which display increased levels of expression, concomitant with growth arrest and onset of morphological differentiation. These have the potential to be important initiators of myeloid differentiation. Clusters 9 and 10 were closely related and contained genes with gradual increases in expression over time in cells undergoing GM differentiation in the GM-CSF and FI
conditions. These clusters included genes known to be involved in inducing myeloid differentiation such as PU.1 [4
],Notch1 [67
],Cdkn2a [68
, 69
], and Egr2 [70
]. As would be predicted for this class of genes, a number of them (e.g., PU.1 and Cdkn2a) can act as myeloid tumor suppressors. Of novel interest to the myeloid system was the circadian rhythm gene, Per2 [71
], which functions as a tumor suppressor. It is important that a recent study identified Per2 as a C/EBP
target gene in myeloid cell lines and showed that enforced Per2 expression induced tumor suppressor activity in leukemic cell lines [72
]. Other genes of interest in Clusters 9 and 10 are Btg2 [73
, 74
] and Tcf7l2 [75
].Btg2 is an antiproliferative, p53-dependent transcriptional coregulator, which may also be a target of c-Myb repression in myeloid cells. The increase in Btg2 gene expression was also confirmed by QRT-PCR (Fig. 4)
.
Genes in Cluster 12 generally displayed early and large increases in gene expression (especially in FI
-induced differentiation). This cluster contained six genes, and of these, Fos [76
], Egr1 [55
], and Hlx1 [77
] are known inducers of myeloid differentiation, making the other three genes (Egr3 [78
], Klf5 [79
, 80
], and an uncharacterized expressed sequence tag clone) of potential interest. Egr3 has not been reported to be important in the myeloid lineage but has known roles in the T cell lineage. Klf5 has been reported to be a direct target of C/EBP
in myeloid cells and is involved in the C/EBP cascade that regulates adipocyte differentiation [80
]. The transcriptional regulation of Klf5 was confirmed by QRT-PCR and can be seen in Figure 4
. A key to understanding the role of these potential novel regulators in the switch between myeloid proliferation and differentiation will be functional analysis approaches in the FDB1 cell line as described above, prior to studies in primary cell and in vivo systems.
A common gene-expression signature between the V449E-activated mutant and human AML: involvement of C/EBP
Given the selective leukemogenic capacity of the V449E mutant, we wished to identify gene-expression patterns associated specifically with signaling from this activated mutant. As shown in Figure 3B
, there are a striking number of gene-expression differences unique to the V449E cell population. In particular, a total of 44 genes was down-regulated 1.4-fold or more (maximum, eightfold) in the V449E population, relative to the FI
and parental populations in IL-3 (F-test P value 1x105). Fold changes for these genes relative to parental FDB1 cells in IL-3 are shown in Table 4
(for full gene annotations, see Supplementary Table 6). Closer inspection of the V449E-down-regulated gene list revealed that 10 of the 44 genes are known C/EBP
and/or C/EBP
target genes (marked with asterisks in Table 4
). To investigate this potential link with C/EBP activity further, we examined promoter regions of this gene set using the Cureos TF/promoter database (http://cureos.wehi.edu.au). A total of 15 genes contained high-quality C/EBP consensus-binding sequences conserved between mouse and human matches (Transfac matrix V$CEBP_Q3). Given that 4706 of the 17,731 unique genes on the array contained consensus sites, this suggests an over-representation of CEBP-binding sites (
2 test P value of 0.25). To investigate this link further, we generated a larger gene set by relaxing the requirement that genes must be down-regulated by more than 1.4-fold compared with the parental and FI
population in IL-3 (but retaining the F-test, P value of 1x105). Using this gene set, we observed that 80 of the 203 down-regulated genes contained consensus-binding sites, which represents a significant enhancement (
2 test P value of 3x105). This suggests that the V449E down-regulated group of genes may contain many novel C/EBP targets in addition to those known targets already identified, and it suggests a model in which V449E signaling affects C/EBP activity, which contributes to a block in differentiation of FDB1 cells [34
] and contributing to AML in vivo.
To test whether this V449E-specific gene signature was related to the changes observed in primary human leukemias, we compared the V449E gene set (Table 4)
with a number of AML signatures identified using gene-expression profiling [51
]. For this, we based our analysis on the gene set-enrichment approach of Mootha et al. [87
] but adapted the approach for across the microarray platform use by using the Wilcoxon Rank Sum tests. This analysis revealed a highly significant association (P=0.0006) with a specific AML cluster (Cluster 4 in Valk et al. [51
]; Table 5
). Eight of the 15 leukemias in this cluster carried mutations in the C/EBP
gene, suggesting that a loss of C/EBP
activity is central to this leukemic phenotype. Taken together, these findings provide further support for a model in which V449E signaling affects C/EBP activity, which is important for is leukemic properties. The FDB1 cell line model provides a system with manipulability, which can now be used to study the mechanism of dysregulation of C/EBP
by activated receptor signaling and the target genes that may be central to AML pathogenesis.
|
View this table: [in a new window] |
Table 5. Association of the V449E-Associated Gene Set with AML Subtypes Defined by Valk et al. [51
]
|
Received February 23, 2006; revised March 20, 2006; accepted March 23, 2006.
|
|
|---|
is a regulatory switch sufficient for induction of granulocytic development from bipotential myeloid progenitors Mol. Cell. Biol. 18,4301-4314
subunit Blood 95,120-127
and steroidogenic factor 1 transcription factors Mol. Cell. Biol. 23,5366-5375
activity alters gene expression and differentiation of human CD34+ cells Blood 101,2206-2214
Nucleic Acids Res. 26,3034-3043
regulate the monocytic/myeloic gene MRP14 Immunobiology 199,148-151[Medline]
Blood 97,2561-2567
and
(C/EBP
and C/EBP
) and CDP/cut in myeloid maturation-induced lactoferrin gene expression Blood 101,3460-3468
isoforms regulate transcription of the gene encoding eosinophil granule major basic protein J. Biol. Chem. 277,43481-43494
-mediated cell cycle arrest during granulocytic differentiation Blood 103,1286-1295
-responsive genes involved in oxidative phosphorylation are coordinately downregulated in human diabetes Nat. Genet. 34,267-273[CrossRef][Medline]
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||