Embryonic stem cells (ESCs) exist in at least two states that transcriptionally resemble different stages of embryonic development. Naïve ESCs resemble peri-implantation stages and primed ESCs the pre-gastrulation epiblast. In mouse, primed ESCs give rise to definitive endoderm in response to the pathways downstream of Nodal and Wnt signalling. However, when these pathways are activated in naïve ESCs, they differentiate to a cell type resembling early primitive endoderm (PrE), the blastocyst-stage progenitor of the extra-embryonic endoderm. Here, we apply this context dependency to human ESCs, showing that activation of Nodal and Wnt signalling drives the differentiation of naïve pluripotent cells toward extra-embryonic PrE, or hypoblast, and these can be expanded as an in vitro model for naïve extra-embryonic endoderm (nEnd). Consistent with observations made in mouse, human PrE differentiation is dependent on FGF signalling in vitro, and we show that, by inhibiting FGF receptor signalling, we can simplify naïve pluripotent culture conditions, such that the inhibitor requirements closer resemble those used in mouse. The expandable nEnd cultures reported here represent stable extra-embryonic endoderm, or human hypoblast, cell lines.
Embryonic stem cells (ESCs) are immortal, karyotypically normal cells derived from the mammalian pre-implantation embryo (reviewed by Smith, 2001). They are defined as pluripotent, as they can differentiate into all lineages of the future conceptus, and have been derived from a number of species, including humans (Thomson et al., 1998). At the stage of traditional ESC derivation, the pre-implantation embryo is in the process of generating three lineages as a result of two successive lineage segregation events. The first of these occurs as a result of morula compaction, leading to formation of the extra-embryonic trophoblast on the outside and the inner cell mass (ICM) on the inside. The blastocyst is then formed following cavitation, and ICM cells differentiate to either pluripotent epiblast (EPI) or the bipotent extra-embryonic primitive endoderm (PrE), also referred to as hypoblast in non-murine species.
ESCs are characterised by expression of a set of transcription factors (TFs) collectively referred to as the pluripotency network, including NANOG, OCT4 (also known as POU5F1) and SOX2, that are widely expressed in both the pre-segregation ICM and in the EPI (Dietrich and Hiiragi, 2007; reviewed by Chambers and Tomlinson, 2009; Morgani and Brickman, 2015; Martin Gonzalez et al., 2016). In addition to the pluripotency network, TFs specific to other lineages of the pre-implantation embryo include CDX2 in the trophoblast, and GATA6 and GATA4 in the nascent PrE (reviewed by Rossant, 2018). In mouse, the segregation of PrE and EPI from the ICM is regulated by FGF/ERK signalling, and the inhibition of this pathway can be used to support ESC self-renewal (Burdon et al., 1999; Chazaud et al., 2006; Yamanaka et al., 2010). This observation has been crucial for the development of defined conditions to derive and maintain ESCs in what has been referred to as the ground state of pluripotency, using a combination of inhibitors of MEK1 (MAP2K1) and MEK2 (MAP2K2) [both inhibited by PD0325901 (PD03)] and GSK-3 [inhibited by CHIR99021 (CHIR)], along with the cytokine LIF (2i/LIF) (Ying et al., 2008).
In contrast to mouse, conventional human ESCs (hESCs) differ in their culture requirements and instead exploit FGF/ERK activity to maintain pluripotency. An explanation for these differences has been that these in vitro cell types approximate different stages of embryonic development, and that the components required to support hESCs are instead required for pluripotency at later developmental stages in the mouse (Brons et al., 2007; Tesar et al., 2007). Mouse cell lines derived in human ESC conditions no longer bear resemblance to the peri-implantation embryo, but instead exhibit a gene expression signature and morphology similar to their human counterpart and the pre-gastrulation stages of embryonic development. These have therefore been termed EPI-derived stem cells (EpiSCs). The existence of two distinct cell types indicates that pluripotency exists across several developmental stages (reviewed by Nichols and Smith, 2012). Cells maintained in ground state conditions are considered naïve, whereas cells cultured in the presence of FGF represent a more advanced stage of development, in which cells are ‘primed’ for differentiation. A number of recent reports have identified culture conditions that capture a more naïve phase of human pluripotency, all of these conditions include 2i/LIF, but vary in the set of additional factors added to this media (Gafni et al., 2013; Takashima et al., 2014; Theunissen et al., 2014; Ware et al., 2014).
In mouse, immortal stem cell lines capable of indefinite self-renewal have been derived from the extra-embryonic, as well as the embryonic, lineages. Trophoblast stem (TS) cells can be derived from the polar trophectoderm and extra-embryonic endoderm (XEN) cells from the PrE (Tanaka et al., 1998; Kunath et al., 2005; Niakan et al., 2013). However, although derived from the blastocyst, these culture systems share characteristics of the later, more restricted extra-embryonic lineages, as TS cells resemble the extra-embryonic ectoderm and XEN cells resemble the parietal endoderm (Tanaka et al., 1998; Kunath et al., 2005). We recently demonstrated that in the presence of activin A (ActA) and CHIR, mouse naïve ESCs robustly differentiate to bipotent PrE that can be expanded as an in vitro naïve extra-embryonic endodermal progenitor (nEnd), capable of contributing to both the visceral and parietal endoderm (Anderson et al., 2017). Although evidence does exist for extra-embryonic endoderm differentiation in human models, none of these studies has produced expandable cell lines in the absence of genetic manipulation or abnormality. SOX7 overexpression in primed hESCs induced extra-embryonic endoderm progenitors that could maintain their phenotype through expansion (Séguin et al., 2008), and cell lines derived from human testicular teratomas have been reported to have either visceral or parietal characteristics (Pera et al., 1987). Furthermore, addition of retinoic acid to human embryonal carcinoma cells induced extra-embryonic endodermal gene expression (Roach et al., 1994). In hESCs, protein kinase C (PKC) stimulated differentiation to an endodermal cell type that undergoes epithelial-to-mesenchymal transition (EMT) and expresses markers of the extra-embryonic parietal endoderm (Feng et al., 2012). More recently, single-cell analysis of primed hESCs revealed a population of pluripotent founder cells that resemble extra-embryonic endoderm (Nakanishi et al., 2019). Taken together, although it appears to be possible to generate PrE cell types in vitro, no genetically stable self-renewing cell line or culture system has been reported for human PrE, or hypoblast.
In mammals, endoderm differentiation occurs in two waves, an early extra-embryonic wave giving rise to the PrE, and a later, gastrulation-stage differentiation resulting in the embryonic definitive endoderm (DE). Although these tissues share expression of several markers, they have traditionally been viewed as having different fates, with PrE contributing to the extra-embryonic visceral yolk sac and the majority of the embryonic organs coming from the DE (Lawson et al., 1987). This notion has recently been challenged by fate mapping and single-cell transcriptomics, asserting that mouse PrE can contribute to both the embryonic and extra-embryonic lineages (Kwon et al., 2008; Chan et al., 2019; Nowotschin et al., 2019; Peng et al., 2019; Pijuan-Sala et al., 2019). Moreover, although the induction of these two lineages were seen as modulated by distinct signalling pathways, we have previously shown that, in mouse, endoderm induction is context dependent. Differentiation is mediated by the same pathways acting on distinct developmental cell types, with signalling downstream of Wnt and Nodal inducing PrE from naïve ESCs, and gastrulation-stage DE from EpiSCs (Anderson et al., 2017). Together, these findings suggest that endoderm differentiation in mammals occurs based on common inductive mechanisms acting over a range of developmental stages, in which the stage determines the precise nature and fate of the endoderm induced.
Here, we ask whether context-dependent endoderm induction is evolutionarily conserved, by testing whether stimulation of Wnt and Nodal-related TGF-β signalling can give rise to distinct endodermal cell types depending on the starting population of pluripotent hESCs. We found that addition of ActA and CHIR to naïve and primed hESCs produced PrE and DE, respectively. As in the mouse, we also observed that FGF signalling is required for PrE differentiation in vitro and that we could exploit the activity of FGF signalling in PrE differentiation as a means to improve naïve hESC culture. Finally, we are able to expand PrE as nEnd, to produce stable human extra-embryonic endoderm cell lines resembling the in vivo hypoblast.
Naïve human pluripotent cells produce primitive endoderm in response to Wnt and nodal-related TGF-beta signalling
To investigate the context-dependent nature of the endoderm fate in the mouse, we previously used a fluorescent reporter mouse ESC line for the endodermal markers Hhex and Hnf4a, in which the order of induction of these two genes could distinguish between PrE and DE differentiation (Anderson et al., 2017). However, recent single-cell transcriptomic studies of the mouse and human embryo demonstrate that the role of HHEX in human PrE is not conserved and that HHEX is uniquely expressed in human DE, whereas FOXA2 is expressed in both lineages (Fig. S1A) (D'Amour et al., 2005; Blakeley et al., 2015; Boroviak et al., 2018). To determine whether the same context dependence of embryonic versus extra-embryonic endoderm induction is also true in human, we generated a double fluorescent reporter for HHEX and FOXA2 in H9 hESCs (H9-HF). This was achieved by CRISPR/Cas9-mediated homologous recombination targeting H2B-mCherry to the first exon of HHEX, and replacing the endogenous stop codon of FOXA2 with an in-frame fusion to a linker, followed by the mVenus coding sequence (Fig. S1B-F). To understand the relationship between differentiation and proliferation in PrE specification, we also generated a cell cycle H2B-mCherry-F2A-mVenus-hGem(1-110) (H9-G2M) reporter by random integration (Fig. S1G-I) in H9 hESCs. All reporter cell lines were generated using primed hESCs cultured in FGF/knockout serum replacement (KSR) medium.
To generate paired versions of primed and naïve cell lines, primed H9 wild-type and reporter cell lines were chemically reset (cR) following transient exposure to a histone deacetylase inhibitor (Guo et al., 2017). These were maintained as naïve hESCs in t2iLGö (naïvet2iLGö) (Takashima et al., 2014) (Fig. S2A,B). Based on our analysis of published transcriptomic data of cR and in vivo-derived naïve hESCs, we identified CD53 as a putative cell-surface marker of human naïve pluripotency (Takashima et al., 2014; Guo et al., 2017). Whereas primed H9 expressed high levels of SSEA4, chemically reset H9 (cR-H9) and H9-HF (cR-H9-HF) cells expressed CD53, but not SSEA4 (Fig. S2C,D). This expression pattern was also observed in the embryo-derived human naïve ESCs (HNES1) (Guo et al., 2016) (Fig. S2C). We confirmed the identity of cR cell lines by morphology, immunofluorescence and qRT-PCR analysis of naïve-specific markers, such as KLF17, DPPA3 and TFCP2L1, and the ubiquitous pluripotency markers SOX2 and NANOG (Fig. S2E-G).
We asked whether the primed and naïve H9-HF reporter ESCs could differentiate toward distinct endodermal subtypes in response to Wnt and Nodal signalling (Fig. 1A). In parallel, the cell lines were exposed to ActA, CHIR and LIF in RPMI 1640 GlutaMAX base medium supplemented with B27 minus insulin (RACL) (see Materials and Methods). Following 4-5 days of differentiation, primed H9-HF gave rise to a HHEX-mCherry/FOXA2-mVenus double-positive population, whereas up to 7 days of differentiation with cR-H9-HF did not, suggesting that only primed cells were able to generate DE (Fig. 1B,C). This was consistent with qRT-PCR analysis, which demonstrated the upregulation of various DE markers, such as MIXL1, CXCR4 and GSC in RACL-treated primed H9-HF, but not in cR-H9-HF (Fig. 1D). In contrast, the figure also shows that RACL-treated cR-H9-HF cells expressed high levels of the early PrE marker PDGFRA (Artus et al., 2010), and the PrE marker NID2 identified from single-cell sequencing data of the human blastocyst (Petropoulos et al., 2016). These data, and others (Blakeley et al., 2015), also suggest BMP6 as a potential marker of human PrE. However, we observed BMP6 mRNA to be expressed in both endodermal cultures, as were the endodermal TFs GATA6 and GATA4 (Fig. 1D). When the transcriptome of in vitro-derived human PrE was directly compared with DE, we observed considerable differences between these cell types, with 3457 genes upregulated and 3294 downregulated. As some of these differences can be attributed to developmental stage (i.e. primed or naïve), we filtered these datasets for genes that were also differentially expressed between the two hESC populations. From this we identified 722 and 845 genes uniquely up- and downregulated in PrE compared with DE, respectively, further suggesting that these are two distinct in vitro endodermal populations (Fig. 1E, Table S1A-D).
To probe the kinetics of endoderm differentiation from naïve hESCs, we assessed the timing of NANOG and GATA6 expression by immunofluorescence. Although only NANOG+ cells were observed during the first few days of differentiation, a transient NANOG/GATA6 double-positive population emerged between days 3 and 4, and these cells eventually resolved their identity, with a GATA6 single-positive population arising from day 5 onwards. By day 7, all NANOG-expressing cells were lost and the culture was comprised entirely of GATA6+ putative PrE colonies (Fig. 2A). Similar expression kinetics were observed by qRT-PCR (Fig. 2B). Pluripotency genes, such as NANOG, DPPA3 and OCT4, were downregulated by day 4 of differentiation, and panendodermal genes GATA6 and GATA4 were upregulated from day 3. Specifically, we observed an initial upregulation of GATA6 at day 2 followed by GATA4 from day 3, mirroring the in vivo sequence of TF activation observed during mouse pre-implantation development (Yamanaka et al., 2010). We also observed an upregulation of the human PrE-specific marker NID2 (Fig. 2B). Whole-transcriptome analysis revealed that a number of genes specifically upregulated in PrE compared with EPI in vivo, are upregulated in PrE derived from naïve hESCs (Yan et al., 2013; Blakeley et al., 2015) (Fig. S3A). The derivation of PrE from naïve hESCs was corroborated by similar experiments performed with two other naïve hESC lines, HNES1 and cR-Shef6 (Takashima et al., 2014; Guo et al., 2017) based on morphology (Fig. 2C), gene expression (Fig. 2D) and flow cytometry for PDGFRA (Fig. S3B).
Laminin-511 supports feeder-independent differentiation of PrE
A key difference between mouse and human naïve ESC cultures is the substrates on which they are propagated. Naïve hESCs are typically cultured on feeders, whereas mouse ESCs (mESCs) are cultured in feeder-free conditions, primarily on gelatine. Although naïve mESCs were originally cultured on feeders, the factor produced by these feeders that was sufficient to support pluripotency was identified as LIF (Smith et al., 1988). We therefore asked whether PrE differentiation could proceed in the absence of feeders.
Based on the previous observations that naïve hESCs could be stably maintained on a truncated laminin-511 fragment (LN511-E8) (Takashima et al., 2014; Guo et al., 2016), we attempted to culture these on the commercially available full-length laminin-511 (LN511, BioLamina). We confirmed that cR-H9 maintained their naïve phenotype by assessing morphology (Fig. S4A), CD53 expression (Fig. S4B) and their transcriptome (Fig. S4C, Table S2A-B), from which we confirmed individual naïve markers (Fig. S4D). As we found LIF to be essential for PrE specification in mouse (Morgani and Brickman, 2015; Anderson et al., 2017), we rationalised the continued use of the cytokine in human differentiation, and based on the transcriptome of the resulting cultures, PrE differentiation appeared to be feeder independent (Fig. S4E-G). Principal component analysis (PCA) of these datasets highlights the similarities in gene expression profiles of cells cultured and differentiated on LN511 or mouse embryonic fibroblasts (MEFs). PC1 separates naïve pluripotent cells from PrE (80%), whereas the small amount of variance in PC2 resolves any differences resulting from the different substrates (6%, accounting for a variation of <130 genes in both states) (Fig. S4E). Furthermore, cells differentiated to PrE on LN511 maintained similar levels of expression of PrE lineage markers to those differentiated on MEFs (Fig. S4G, Table S2C-D).
FGF signalling is required for PrE differentiation in vitro.
Although the role of FGF/ERK signalling in PrE specification is well established in the mouse, both in vitro and in vivo, its role in primates has remained unclear (Arman et al., 1998; Nichols and Smith, 2009; Yamanaka et al., 2010; Kuijk et al., 2012; Roode et al., 2012; Anderson et al., 2017). Based on culturing human blastocysts ex vivo, FGF signalling does not appear to be required for PrE differentiation. Treatment of embryos with inhibitors of FGF signalling [MEK1/2 inhibitor PD03 and FGFR1 inhibitor PD173074 (PD17)] does not affect the emergence of mutually exclusive NANOG- and GATA4-positive cells, with the hypoblast forming normally (Roode et al., 2012). However, as PD03 is included in most formulations of human naïve ground state conditions, coupled with its function in the mESCs to block PrE differentiation (Hamilton and Brickman, 2014), we reasoned that it may serve as an inhibitor of PrE differentiation in vitro.
We asked whether FGF signalling was required for specification of PrE from naïve hESCs in vitro in the presence of RACL. We found that both 1 µM PD03 and 1 µM PD17, when applied individually or combined, reproducibly blocked differentiation of cR-H9 to PrE (Fig. 3A-F). After 7 days with inhibitors, cells failed to upregulate GATA6 and GATA4 or downregulate NANOG (Fig. 3C,D). We observed that both PD03 and PD17 attenuated the upregulation of PDGFRA, and that although PD03 significantly upregulated CD53, both conditions with PD17 produced further increases in naïve surface marker expression (Fig. 3E). Based on qRT-PCR, inhibition of FGFR1 and MEK1/2 resulted in increased expression of NANOG, OCT4 and KLF17, and blocked PrE gene expression (Fig. 3F). Although the inclusion of PD03 and PD17 to RACL resulted in a high level of cell death (Fig. 3B), the morphology of the remaining colonies appeared to be naïve-like (Fig. 3A).
FGFR1 inhibition supports naïve pluripotency in the absence of an atypical PKC inhibitor
Although the culture conditions for both mouse (2i/LIF) and human (t2iLGö) naïve ESCs are remarkably similar, with both sharing four key components (N2B27 supplemented with CHIR, PD03 and LIF), one notable difference is the addition of the broad-spectrum PKC inhibitor Gö6983 (Gö). Gö is reported to produce strong autofluorescence from bright red aggregates both in vivo and in vitro, making the use of fluorescent reporters challenging (Saiz et al., 2013; Takashima et al., 2014; Guo et al., 2017). We also observed this by fluorescent microscopy and flow cytometry analysis (Fig. S5A,B).
We attempted short-term culture of cR-H9 in the absence of Gö as a means to improve imaging, but found that these cells lost their naïve morphology, surface marker expression, and showed reduced expression of the naïve markers NANOG, OCT4 and KLF17 (Fig. 4A-C). As the knockdown phenotype of the atypical PKC (aPKC) isoform iota/lambda (aPKCι/λ) (Takashima et al., 2014) demonstrated its role in supporting naïve pluripotency, we asked whether we could substitute Gö for another inhibitor targeting aPKC, including CRT0066854, PKC412 and ZIP. However, at all tested concentrations, hESCs lost their naïve morphology, similar to cells cultured without Gö, and some inhibitors caused cell death at high concentrations (Fig. S5C).
Based on our observation that 1 µM PD17 could block PrE differentiation in vitro, and that surviving cells in this condition retained a naïve-like morphology and high levels of CD53, we asked whether the FGFR1 inhibitor could replace Gö in naïve hESC culture. We cultured cR-H9 in N2B27 supplemented with 1 µM PD17, 1 µM CHIR and 10 ng/ml LIF (human-formulated 2iL, h2iL). Under these conditions, we found that hESCs could maintain their naïve morphology (Fig. 4A), CD53 expression (Fig. 4B), and differentiate to PrE (Fig. 4D). Importantly, cells cultured in h2iL (naïveh2iL) lost any Gö-derived autofluorescence within 3 passages (Fig. S5A,B).
To confirm that these h2iL cells were indeed naïve, we assessed their transcriptomes and compared them with cells cultured in t2iLGö, along with PrE derived from both culture conditions by RNA-seq. Based on unbiased hierarchical clustering, we observed a strong correlation between naïveh2iL and naïvet2iLGö, and between the h2iL- and t2iLGö-derived PrE (PrEh2iL and PrEt2ilGö, respectively) (Fig. 4E). When differentiated PrE gene expression was compared with both naïve states, these differentiation-specific changes were strongly correlated (Fig. S5D). PCA comparing the two naïve culture conditions with endoderm derived from these, in addition to primed pluripotent cells, suggests that the transcriptomes of naïveh2iL and naïvet2iLGö, and PrEh2iL and PrEt2iLGö are strikingly similar (Fig. 4F). These observations were also supported by differential gene expression analysis between naïveh2iL and naïvet2iLGö that revealed 40 upregulated and 64 downregulated genes (Fig. 4G, Table S3A,B). Gene ontology (GO) analysis of this small set of differentially expressed genes revealed enrichment in specific terms associated with extracellular matrix components. Similar observations were made when comparing PrEh2iL and PrEt2iLGö (Fig. S5E; Table S3C,D). When inspecting normalised expression for a panel of pluripotency and extra-embryonic endoderm genes, we found no overall difference between the two naïve ESC conditions (Fig. 4H). When placing these cells back into primed culture, they readily converted to conventional hESCs expressing high levels of SSEA4 (Fig. S5F,G). As a relatively high concentration of PD17 (1 µM) was required (Fig. S5H), we considered a possible interaction with the related VEGF and EGF receptors (VEGFR and EGFR, respectively). We tested the ability of specific inhibitors of VEGFR (CBO-P11) (Deshayes et al., 2011) and EGFR (PD153035) (Bos et al., 1997) to mimic the activity of PD17 and found that neither was sufficient to maintain naïve pluripotency (Fig. S5I,J). Based on this observation, PD17 does not appear to be acting on lower affinity targets. Thus, together with LIF and GSK-3 inhibition, PD17 supports naïve hESC self-renewal, indicating that the underlying mechanisms governing naïve pluripotency may be better conserved than has previously been thought.
Human primitive endoderm can be expanded as an in vitro model for naïve extra-embryonic endoderm
In mouse, we recently identified conditions that support self-renewal in extra-embryonic PrE and that this was based on the activity of ActA, CHIR and LIF. As these cells can contribute to both parietal and visceral endoderm in vivo, we referred to these cultures as nEnd (Anderson et al., 2017). Here, following differentiation of the cR-H9-G2M reporter to PrE in RACL, we observed that these cultures contained proliferating cells, indicated by green fluorescing G2/M cells co-expressing GATA6 (Fig. S6A,B). However, the rate of cell death was significant and exceeded growth upon passaging, such that the culture expired in 3 passages (Fig. S6C,D). As mouse nEnd contains high levels of insulin-responsive AKT activation, including phosphorylations that are known responses to the PI3K and mTOR pathways (Anderson et al., 2017), we asked whether we could improve human nEnd survival by supplementing insulin to RACL. Here, we found that PrE cultures expanded in a dose-dependent manner to insulin (Fig. S6E). We recently reported that other components of the insulin-rich N2B27 medium also supported expansion of naïve mESCs in a similar cytokine cocktail to that used in PrE differentiation (Anderson et al., 2017). We therefore also assessed replacing the RACL basal media with N2B27 [supplemented with ActA, CHIR and LIF (NACL)] and found that these cells could be maintained for more than 13 passages (Fig. 5A). During PrE differentiation, flow cytometry analysis revealed a population expressing GATA6 that was heterogeneous with respect to PDGFRA. This resolves in expansion as an exclusive double-positive nEnd population (Fig. 5B). These consistently expressed PrE markers (Fig. 5C, Fig. S6F) and immunofluorescence demonstrated that these cultures are GATA6+/NANOG− (Fig. 5D). This is consistent with the observed uniform morphology of nEnd, in which ESCs do not re-emerge at any point during expansion. Following an initial passaging ratio of 1:2, nEnd cultures were routinely passaged 1:4 for 13 passages, which constitutes a 3×107-fold expansion. Therefore, as in mouse, human nEnd can be expanded as an in vitro model for extra-embryonic endoderm development. Our approach to establishing this culture system is summarised in Fig. 5E, with an overview of passaging techniques and media compositions given in Fig. S6G,H. Although FGF/ERK signalling is required for PrE differentiation in vitro, it is not required for self-renewal of nEnd (Fig. S6I,J). This suggests that the role of FGF/ERK inhibition in suppressing differentiation is not based on a selective mechanism.
During early mammalian development, PrE and EPI precursors within the ICM undergo cell sorting. The PrE is the first of the two cell types to become polarised, forming an epithelialised sheet that produces basement membrane (BM) components, facilitating subsequent embryonic development (Koutsourakis et al., 1999; Smyth et al., 1999; Gerbe et al., 2008; Schrode et al., 2014). To determine the functional capacity of nEnd, we assessed BM production. Here, we found that nEnd produces BM factors normally expressed in both human and marmoset hypoblast in vivo (Blakeley et al., 2015; Boroviak et al., 2018), including FN1, LAMA1 and COL4A1, all of which are notably absent in naïve hESCs (Fig. 5F,G). In support of this observation, we also found that expression of mRNA encoding a range of additional hypoblast-associated BM components was upregulated in nEnd relative to naïve hESCs (Fig. S6K).
In mouse, XEN cells can be converted to a visceral endoderm (VE)-like cell type through culture on laminin- or gelatine-coated substrates in the presence of BMP4 (Artus et al., 2012; Paca et al., 2012). We previously reported similar observations when applying these conditions to mouse nEnd (Anderson et al., 2017). We therefore assessed the ability of human nEnd to generate VE by culturing these cells on a range of substrates (Fig. 5H). nEnd cultured on LN511 in N2B27 supplemented with 50 ng/ml BMP4 resulted in the downregulation of GATA6 and PDGFRA, upregulation of VE markers AFP and HNF4A, and suppression of basal levels of the parietal marker PLAT. We observed an increase in a second parietal marker, SPARC, but this has also been reported in the VE in vivo (Mason et al., 1986).
Transcriptomic states of naïve epiblast and endoderm
As functional tests of PrE identity in human are ethically restricted, we sought to validate our cell culture model based on in vivo gene expression from the human blastocyst. We utilised a panel of EPI- and PrE-specific markers defined from single-cell transcriptomic analysis of the human blastocyst (Table S4A) (Stirparo et al., 2018). Here, we included the transcriptomes of naïve and primed hESCs, alongside putative PrE. We found that genes specifically associated with EPI in the human blastocyst also exhibit a strong correlation with naïve hESCs, whereas those associated with endoderm show the highest level of correlation with PrE and nEnd (Fig. 6A).
To understand the relative contribution of these EPI and PrE-specific markers in separating ESCs from different endoderm subtypes (i.e. PrE versus DE), we performed PCA on all samples using only these predetermined lineage markers (Fig. 6B). This analysis included in vitro-differentiated PrE, nEnd and DE, as well as DE sorted for prospective foregut identity (based on CXCR4 and KIT expression), referred to as anterior definitive endoderm (ADE) (Morrison et al., 2008; Cheng et al., 2012). We found that these markers were sufficient to distinguish not only ESCs from endoderm, as represented by PC1, but also in vitro-derived PrE and DE in PC2. When including a panel of genes specific to naïve or primed hESCs (Table S4B) (Messmer et al., 2019), in addition to the previous set of markers from the human blastocyst, we observed a further separation of different ESC types. Primed hESCs more closely identified with DE cell types, as indicated by PC1 (Fig. 6C).
To further define the relationship between the developmental stage in pluripotent culture to that derived upon differentiation, we focused on the capacity of another naïve human state to differentiate into endoderm. It has been suggested that naïve hESCs generated using the RSeT system (Gafni et al., 2013) represent a later stage of EPI development, albeit more naïve than conventional primed hESCs (Stirparo et al., 2018). As the early post-implantation EPI was recently shown to generate VE (Nowotschin et al., 2019), we predicted that RSeT cells would differ from primed cultures in their capacity to generate PrE, but might also generate DE. We therefore adapted H9 hESCs to RSeT conditions (naïveRSeT) (Fig. S7A) and assessed endoderm differentiation. After 7 days in RACL, naïveRSeT cells differentiated to a population that exhibited co-expression of the PrE marker PDGFRA and the DE marker CXCR4 (Fig. S7B). In addition, these cells downregulated NANOG and OCT4, and homogenously expressed GATA6 (Fig. S7C,D). Based on normalised expression of a panel of markers and flow cytometry, these cells exhibit a combination of primitive and definitive characteristics (Fig. S7B,E).
Although naïveRSeT cells appeared to generate a compound endodermal population, we considered whether a PrE cell type could be expanded, and how this expansion might impact on endodermal phenotypes. We placed naïveRSeT-derived PrE (PrERSeT) in NACL and these cultures expanded efficiently as putative nEnd, similar to nEndt2iLGö (Fig. S7F). Attempts to expand primed hESC-derived DE in NACL have thus far been unsuccessful, with the differentiated endoderm in these cultures being outcompeted by overgrowth of ESCs (Fig. S7G). To determine the nature of nEndRSeT, we performed whole-transcriptome analysis of these cultures and combined them with the transcriptomes described previously for unbiased hierarchical clustering of the different naïve, primed and differentiated states (Fig. 6D). Here, it is apparent that nEndRSeT clusters with nEndt2iLGö, suggesting that both naïve states can generate extra-embryonic endoderm upon expansion. This analysis also supports the notion that naïveRSeT hESCs either generate an intermediate endodermal cell type that initially clusters with DE/ADE, or a mixture of differentiated cell types that average out to this position (Fig. 6D). To distinguish between these possibilities, we differentiated naïvet2iLGö, naïveRSeT and primed ESCs in the presence of RACL. We assessed each ESC and endodermal cell type for pluripotency, and primitive and definitive endoderm markers using quantitative immunofluorescence (Fig. S7H,I). Endoderm derived from primed cultures contains a large number of brachyury (TBXT)- and FOXA2-positive cells, which are notably reduced in PrEt2iLGö cultures. In contrast, PrERSeT cells contain populations of both positive and negative cells for these DE markers. This suggests that differentiation of naïveRSeT cultures produces a mix of primitive and definitive endoderm that resolves to PrE when placed in NACL, as only the primitive population expands.
The similarities of these different cell states were also observed by PCA (Fig. 6E). Here, PC1 appears to resolve primed and naïve cell types. This would suggest that the t2iLGö/h2iL and RSeT naïve states constitute different steps in a potential developmental progression, with naïveRSeT hESCs representing an intermediate in a continuum from a potential pre-implantation t2iLGö/h2iL state to conventional primed pluripotent cells. This observation is also supported by the expression of established markers, with naïveRSeT expressing high levels of NANOG and DPPA3 (Fig. 6F). Consistent with the single-cell analysis shown in Fig. S7I, which indicates a heterogeneous endoderm population, the whole-transcriptome analysis shows expression of both PrE and DE markers, including PDGFRA and NID2, as well as GSC, CER1 and HHEX (Fig. 6F). Upon expansion in NACL, this heterogeneity is resolved with the PrE/nEnd population apparently selected for, and with the gene expression signature of nEndRSeT closely resembling nEndt2iLGö (Fig. 6D-F).
To define sets of markers unique to these populations of endoderm, we assessed the top 70 genes varying across all endodermal cultures and defined clusters of genes in which expression correlated (Fig. S8A). From these, we observed genes known to be upregulated during gastrulation, such as GSC, CER1 and LEFTY1, clustering to DE, ADE and RSeT-derived endoderm. The potential of NID2 as a specific marker for PrE is reinforced here, with WNT2, TBX4 and COL3A1 also appearing as unique markers for in vitro-differentiated or expanding nEnd.
As a means to verify the hypoblast nature of our in vitro-derived PrE, we compared our bulk transcriptomic data with that derived from single-cell sequencing of cynomolgus monkey embryos (Nakamura et al., 2016, 2017). We performed differential expression analysis of genes that define the transition between naïve pluripotency and PrE/nEnd, as well as naïve ESCs and DE, and equivalent stages of primate development (Fig. 6G, Fig. S8B-E). Based on common genes present in these comparisons, it appears that in vitro-derived PrE/nEnd closer resembles primate hypoblast, rather than later post-implantation stages of endoderm development. In addition, in vitro-derived DE more strongly correlates with a cluster of post-implantation gastrulating cells, Gast2a/b, than with the hypoblast. We applied the same comparison to published human single-cell datasets (Yan et al., 2013; Blakeley et al., 2015) and similarly found that our PrE/nEnd, but not DE, cultures correlate to the changes in gene expression observed in extra-embryonic hypoblast (Fig. S8F-H). Unfortunately, as it is not possible to obtain material for early human in vivo DE and additional extra-embryonic endoderm material, the comparisons that we are able to perform are inherently limited.
Using a reporter cell line marking endodermal types, we have demonstrated that the context-dependent response of pluripotent cells to Wnt and Nodal signalling previously demonstrated in mouse is conserved in human, and that these pathways could induce naïve hESCs to generate PrE and support its expansion. Based on this model for PrE differentiation, we provide a partial resolution to a controversy, showing that FGF signalling is essential for PrE specification in vitro, suggesting that the canonical role of this pathway defined in mouse is conserved. Moreover, consistent with our observations in mouse that FGF/ERK inhibition in ground state conditions supports self-renewal via a block of PrE differentiation (Hamilton and Brickman, 2014), we found that inhibiting this pathway at the level of the FGFR was sufficient to support naïve pluripotency alongside LIF and CHIR.
The overwhelming similarities at the transcriptional level between cells in t2iLGö and h2iL suggests that signalling between human and other vertebrate species, including mouse, is more conserved than has been previously thought. However, why is inhibition of MEK1/2 alone insufficient to support human naïve pluripotency? In PrE differentiation in vitro, we observed that the FGFR inhibitor PD17 was a more effective antagonist of endoderm differentiation than targeted MEK1/2 inhibition. This suggests the presence of a second pathway downstream of the FGFR, required for PrE differentiation, and that this pathway could exploit aPKC. Previous studies have shown that the activities of aPKC are necessary for mitogen signalling via the MAPK/ERK cascade (Berra et al., 1993, 1995). It has also been suggested that aPKCι/λ is recruited to modulate FGF signalling through the docking protein FRS2 (Lim et al., 1999). In this context, aPKCι/λ acts downstream of Ras, but upstream of ERK1/2 (Bjørkøy et al., 1997). As aPKCι/λ has been implicated in PrE specification in mouse development (Saiz et al., 2013), there may be a fundamental aPKCι/λ/FGF-dependent signalling loop involved in PrE induction and segregation from the EPI. In mouse, this loop appears to be MEK1/2-dependent, whereas in human this signalling loop has diverged slightly, so that MEK1/2 and aPKCι/λ represent parallel branch points. Thus, in human, simultaneous inhibition of aPKCι/λ and MEK1/2 or a block to the FGFR is required to block differentiation, support self-renewal and sustain naïve pluripotency.
A number of naïve ESC states have been reported and extensively characterised in human. Here, we have exploited the parallel differences seen in the transcriptomes from ESCs and ESC-derived endoderm that we initially described in mouse, to demonstrate that a similar trajectory is observed in human and that multiple naïve states lie at different positions along this continuum. Thus, naïveRSeT cells lie between naïvet2iLGö and naïveh2iL and conventional primed hESCs, suggesting that these cultures could represent a particularly early state of the post-implantation EPI. This intermediate naïve state appears to be unique in its capacity to generate both embryonic and extra-embryonic endoderm, with the PrE component exhibiting properties of nEnd when placed into expansion. Moreover, as the EPI in vivo generates VE, and the later EPI generates DE (Nowotschin et al., 2019), naïveRSeT may represent a model with differentiation competence for both lineages. In addition, unlike in mouse where only low levels of endodermal transcription is detected in naïve ESC culture (Canham et al., 2010), we and others have observed significant GATA6 protein expression in different human cultures (Chan et al., 2013; Guo et al., 2016). Specifically, in both steady state self-renewal and during early endoderm differentiation, GATA6/NANOG double-positive cells arise, suggesting that cR naïve ESC culture may be closer in identity to the unsegregated ICM. Taken together, these observations suggest that, in human, it appears possible to trap a broad range of putative pluripotent states in culture, capturing stages ranging from unsegregated ICM, through implantation, and eventually to later post-implantation, pre-gastrulation stages in primed culture.
Common to the pluripotent states captured in human ESC culture is the capacity to respond to the evolutionarily conserved Wnt and Nodal pathways (reviewed by Grapin-Botton and Constam, 2007). These pathways induce endoderm differentiation in both naïve and primed hESCs, but in a context-dependent manner. In the naïve ground state, these cells are driven towards the developmentally proximal PrE, whereas this signalling in primed cells mediates differentiation towards DE. The conservation of this level of developmental context in mammals serves to provide better coherence to the fundamental role of these pathways in endoderm induction observed across a range of vertebrate species. Thus, both embryonic and extra-embryonic endoderm lineages are produced as a result of the same conserved signalling cascades.
Extra-embryonic stem cell lines, including TS and XEN cells, have been derived in rodent species (Tanaka et al., 1998; Kunath et al., 2005; Wamaitha et al., 2015). Whereas XEN cells resemble the later parietal endoderm, we recently reported that when applied to mESCs, RACL induced the more developmentally plastic naïve PrE. These nEnd cultures did not show expression of markers exclusive to either parietal or visceral endoderm (Anderson et al., 2017), an observation repeated here in the human model. We find that these human hypoblast cell lines possess the same naïve character as mouse nEnd and therefore should be considered as human nEnd, rather than XEN cells. Although these cell lines are derived in vitro, and their exclusive extra-embryonic character is difficult to verify functionally, they represent an expandable culture system that approximates human extra-embryonic endoderm. As it was recently demonstrated that VE contributes to the mouse embryonic gut (Kwon et al., 2008; Chan et al., 2019; Nowotschin et al., 2019; Peng et al., 2019; Pijuan-Sala et al., 2019), the distinction between embryonic and extra-embryonic endoderm has become blurred. As such, human nEnd represents a culture system with potential for probing the differentiation competence of the hypoblast for human organ culture. Given the focus on extra-embryonic endoderm as a primordial state for the reprogramming of somatic cells to pluripotency (Parenti et al., 2016; Zhao et al., 2018), the existence of a closely related naïve endodermal culture system could serve as an important stepping stone in understanding mechanisms regulating the final stages of induced pluripotency.
MATERIALS AND METHODS
H9 (WA09, WiCell) and HUES4-170-3 (Ameri et al., 2017) hESCs were used for primed culture and subsequent chemical resetting. The cR-Shef6 cell line was a generous gift from the Smith lab (Takashima et al., 2014) and the embryo-derived HNES1 cell line was a generous gift from the Nichols lab (Guo et al., 2016). All lines were routinely screened for mycoplasma, and all were negative.
Conventional primed hESC culture
Primed hESCs were maintained on tissue culture plates pre-coated with 0.1% gelatine and 25×103 per cm2 irradiated MEF feeder cells in KSR (Thermo Fisher Scientific) medium supplemented with 10 ng/ml FGF2 (R&D Systems). Cells were passaged every 3-5 days with Accutase (00455556, Thermo Fisher Scientific), re-plated with 10 µM ROCKi (Y-27632, Stemcell Technologies) for 24 h and maintained in 20% O2, 5% CO2, 37°C.
Naïve cells culture
Chemically reset naïve human ESC lines were generated and cultured using the protocols as specified from Austin Smith's lab (Guo et al., 2017) or by RSeT™ Medium (Stemcell Technologies), a commercial medium based on NHSM formula (Gafni et al., 2013).
Naïve hESCs were maintained in N2B27 supplemented with either t2iLGö [1 µM CHIR (1386, Axon), 1 µM PD03 (PZ0162, Sigma-Aldrich), an activity equivalent of 10 ng/ml LIF (made in house) and 2 µM Gö (2285, Tocris)] or h2iL [1 µM CHIR, 1 µM PD17 (3044, Tocris) and 10 ng/ml LIF]. Additional aPKC inhibitors tested include CRT0066854 (2656, Axon), PKC412 (2992, Tocris) and ZIP (2549, Tocris) at varying concentrations. Cells were seeded on either plates pre-coated with 0.2% gelatine and 15×103 per cm2 MEFs or on 5 µg/ml LN511 (BioLamina) for feeder-free culture. All naïve cells were passaged by dissociation every 3-5 days with Accutase, re-plated with 10 µM ROCKi for 24 h and maintained under hypoxic conditions (5% O2, 5% CO2, 37°C).
Generation of hESC reporter lines
HHEX-mCherry and FOXA2-mVenus reporter cells were generated in parental H9 hESCs (H9-HF) by replacing the endogenous start codon of the HHEX gene with an H2B-mCherry-LoxP-SV40-neoR-LoxP cassette, and by replacing the endogenous stop codon of the FOXA2 gene with a GS-mVenus-LoxP-SV40-neoR-LoxP cassette, respectively (Fig. S1B,D). We first introduced the HHEX-mCherry reporter into H9 hESC via CRISPR nickase-mediated homologous recombination (with gRNA1: 5′-TTATCCGCGCTCCGCGCTCC-3′ and gRNA2: 5′-CGCGGATAAATGTAGCGCCG-3′). Then, the FOXA2-mVenus reporter was introduced to the targeted HHEX-mCherry reporter hESC via CRISPR HF nuclease-mediated homologous recombination (with gRNA: 5′-GAAGCCGTCGTCTTCTTAAG-3′). All cell lines were routinely checked for appropriate antibiotic resistance and the presence of the correct reporters. All lines generated were characterised by PCR and Southern blot (Fig. S1C,E).
For karyotyping, fibroblasts and H9-HF hESCs were treated for 45 min with KaryoMAX Colcemid (Life Technologies), harvested in freshly prepared fixative consisting of 25% acetic acid and 75% methanol and sent for G-band karyotyping (Cell Guidance Systems).
The G2M proliferation tracing hESC reporter was made by stably introducing an H2B-mCherry-F2A-mVenus-hGem(1-110) cassette through random integration (Fig. S1G). The reporter was confirmed by flow cytometry analysis, as shown by all cells marked by H2B-mCherry, with a subset of proliferating cells expressing mVenus-hGem(1-110) (Fig. S1H).
Differentiation to PrE and DE
To differentiate to PrE, naïve or primed hESCs were plated at 5×104 per cm2 onto feeders in RACL [RPMI 1640 medium with GlutaMAX (61870044, Gibco) with B27 minus insulin (A1895601, Gibco) supplemented with 100 ng/ml ActA, 3 µM CHIR and 10 ng/ml LIF] for 7-8 days (PrE) and 4-5 days (DE). The medium was changed every other day until cells reached confluency, at which point medium was changed every day.
When PrE cells reached confluency, they were passaged by dissociation using Accutase, detached from the plate using a cell scraper and collected in clusters using a stripette (Fig. S6G). These were re-plated on feeders at a ratio of 1:2 to 1:4 in NACL (N2B27 supplemented with 100 ng/ml ActA, 3 µM CHIR and 10 ng/ml LIF) every 4-7 days, supplemented with 10 μM ROCKi for the first 24 h. nEnd cultures were frozen 1:1 in 60% NACL supplemented with ROCKi, 30% foetal bovine serum (FBS; Gibco) and 10% dimethyl sulfoxide (DMSO).
Immunostaining and imaging
Cells were fixed with 4% paraformaldehyde for 10 min at 37°C, blocked and permeabilised with 2% donkey serum, 0.3% Triton X-100 and 0.1% bovine serum albumin in PBS for 1 h at room temperature. Primary antibodies (Table S5) were incubated with 3% FBS in PBS overnight at 4°C and secondary antibodies were added for 1 h at room temperature. Brightfield imaging of cells was performed using a Nikon microscope. Fluorescent imaging was carried out using either a Leica AF6000 fluorescent microscope or a Leica SP8 confocal microscope.
Cells were dissociated with Accutase and resuspended as single cells with 3% FBS in PBS. Antibodies (Table S5) were incubated for 45 min to 1 h at 4°C and cells were stained with DAPI to exclude dead cells. Cells were analysed using an LSRFortessa (BD Biosciences) and data analysed with the FACSDiva and FCS Express 6 software (BD Biosciences).
RNA preparation and quantitative RT-PCR
Total RNA was extracted using the RNeasy Mini Kit (Qiagen) and cDNA was synthesised using SuperScript III reverse transcriptase (Invitrogen, 11904018) and Random Hexamer primers (Invitrogen, N8080127). RT-qPCR was performed using the LightCycler 480 Instrument II (Roche Life Science) using the primers and probes listed in Table S6. The housekeeping genes ACTB and GAPDH were used to normalise expression.
We mixed 100 ng of total RNA with RNA standards (Agilent; 5188-5282) and labelled with the LowInput QuickAmp Labeling Kit One-Color (Agilent; 5190-2305) according to the manufacturer's instructions. For each sample, 600 ng of labelled RNA was hybridised to a Human Gene Expression 8×60K v2 (Agilent; G4851B). RNA quantification and assessment of integrity was performed using the total RNA nanochip (Agilent; 5067-1511) measured on an Agilent Bioanalyzer. Microarrays were hybridised for 17 h, washed according to the manufacturer's instructions and scanned on a Surescan G2600D scanner using Agilent Scan Control 22.214.171.124 software with default settings (GE 8×60k microarrays). The resulting single-channel TIFF images were processed using feature extraction software (Agilent). All microarray expression data were analysed using ExAtlas (Sharov et al., 2015).
RNA integrity was assessed using a Fragment Analyzer (Advanced Analytical Technologies) and ribosomal RNA was removed from the samples using the NEBNext Poly(A) mRNA Magnetic Isolation Module (New England Biolabs). Sequencing libraries were prepared from 250 ng of purified total mRNA using NEBNext Ultra II RNA Library Prep Kit from Illumina (New England Biolabs) according to the manufacturer's instructions. Single-read RNA-seq was performed using the NextSeq 500 (Illumina) with High Output Kit (Illumina, FC-404-2005). Fastq files were aligned to the hg38/GRCh38 genome using STAR v2.5.3a (Dobin et al., 2013). Transcript expression levels were estimated with the –quantMode GeneCounts option and GRCh38p10.v27 annotations. FastQC v0.11.7 was used for QC metrics, and multiqc v1.7 (Ewels et al., 2016) for reporting. Data analysis was then performed with R/Bioconductor (Gentleman et al., 2004). Normalisation was performed with DESeq2 (Love et al., 2014). All RNA-seq data were analysed using the Bioconductor package DESeq2 (Love et al., 2014). Human (GSE36552, GSE66507) and cynomolgus monkey (GSE74767) single-cell RNA-sequencing datasets were analysed using the R package ‘Seurat’ (Stuart et al., 2019). Differentially expressed genes for each cell stage were calculated using the ‘FindMarkers’ function against Hypoblast versus EPI (human) or Hypoblast versus ICM/Pre-EPI (monkey) stages. Log2 fold changes of shared differentially expressed genes between single-cell RNA-seq stages and bulk RNA-seq samples were compared using Pearson correlations.
Statistics and reproducibility
Statistical tests were performed using GraphPad Prism 8. Significance was defined as: not significant>0.05; *P<0.05, **P<0.01, ***P<0.001, ****P<0.0001. Unless otherwise stated, error bars indicate ±s.d., P values were determined by standard t-test and all experiments are representative of three experiments or were reproduced with three biological replicates (n=3).
We thank Austin Smith and Jennifer Nichols (University of Cambridge) for the cR-Shef6 and HNES1 cell lines; Henrik Semb for the HUES4-170-3 cell line; and Helen Neil, Magali Michaut and the DanStem Genomics Platform for technical expertise, support and the use of instruments. Data processing and analysis were performed using Computerome Supercomputer at DTU (www.computerome.dk). We thank the DanStem Flow Cytometry Platform, the DanStem Imaging Platform and the DanStem Stem Cell Culture Platform for training, technical expertise, support and the use of instruments. We also thank Anne Grapin-Botton, Henrik Semb and members of the Brickman lab for critical comments on this manuscript. The Novo Nordisk Foundation Center for Stem Cell Biology is supported by a Novo Nordisk Foundation grant number NNF17CC0027852.
Conceptualization: M.L.-A., Y.F.W., J.M.B.; Methodology: M.L.-A., Y.F.W., K.G.V.A., J.M.B.; Software: M.L.-A., J.A.R.H., R.S.M.; Validation: M.L.-A., Y.F.W.; Formal analysis: M.L.-A., Y.F.W., J.A.R.H., R.S.M.; Investigation: M.L.-A., Y.F.W.; Data curation: M.L.-A., J.A.R.H.; Writing - original draft: M.L.-A., J.M.B.; Writing - review & editing: M.L.-A., Y.F.W., J.A.R.H., R.S.M., J.M.B.; Visualization: M.L.-A.; Supervision: Y.F.W., J.M.B.; Project administration: J.M.B.; Funding acquisition: R.S.M., J.M.B.
This work was funded by HumEn under the European Union Seventh Framework Programme FP7/2007-2013 (HEALTH-F4-2013-602889) and by the Danmarks Frie Forskningsfond (DFF-6110-00009). R.S.M. is supported by a fellowship from the Lundbeckfonden (R303-2018-2939).
The authors declare no competing or financial interests.