Abstract
Generation of induced pluripotent stem cells (iPSCs) via the ectopic expression of reprogramming factors is a simple, advanced, yet often perplexing technology due to low efficiency, slow kinetics, and the use of numerous distinct systems for factor delivery. Scientists have used almost all available approaches for the delivery of reprogramming factors. Even the well-established retroviral vectors confuse some scientists due to different tropisms in use. The canonical virus-based reprogramming poses many problems, including insertional mutagenesis, residual expression and re-activation of reprogramming factors, uncontrolled silencing of transgenes, apoptosis, cell senescence, and strong immunogenicity. To eliminate or alleviate these problems, scientists have tried various other approaches for factor delivery and transgene removal. These include transient transfection, nonintegrating viral vectors, Cre-loxP excision of transgenes, excisable transposon, protein transduction, RNA transfection, microRNA transfection, RNA virion, RNA replicon, nonintegrating replicating episomal plasmids, minicircles, polycistron, and preintegration of inducible reprogramming factors. These alternative approaches have their own limitations. Even iPSCs generated with RNA approaches should be screened for possible transgene insertions mediated by active endogenous retroviruses in the human genome. Even experienced researchers may encounter difficulty in selecting and using these different technologies. This survey presents overviews of iPSC technologies with the intention to provide a quick yet comprehensive reference for both new and experienced reprogrammers.
Introduction
G

Systems and strategies for factor-mediated pluripotent reprogramming. IRES, internal ribosome entry site.
The first batch of mouse and human iPSC lines were generated using virus-based genome-integrating systems due to the need for lasting transgene expression required for succesful reprogramming. However, any insertion of foreign DNA into the reprogrammed genome will pose a risk of insertional mutagenesis. A second major concern with integrating systems is reactivation and residual expression of the integrated reprogramming factors. All of the reprogramming factors are tumorigenic to some extent with c-Myc as the most notorious oncogene [10]. Actually, c-Myc was found to be responsible for the tumors found in iPSC chimeric mice [3,11]. Therefore, integrating vectors are not the choice for reprogramming when safety becomes a concern. Reduction or complete elimination of transgene integrations has been one of the major goals for all of the improvements cited earlier. The inset of Fig. 1 depicts the strategies for such efforts, which include (i) use of a polycistron to reduce the number of integrations; (ii) use of the Cre-LoxP system to excise the transgenes from the reprogrammed genome; (iii) direct delivery of reprogramming RNA (synthetic mRNA, RNA virus, RNA replicon, or miRNA) to avoid integration of the transgene sequences; (iv) transposon transposition of transgenes into the reprogramming genomes, and subsequent excision of transgenes from the reprogrammed genomes; (v) repeated transfections of cells with nonreplicating plasmids; (vi) protein transduction; (vii) use of reprogramming-promoting small molecules; and (viii) use of nonintegrating and replicating episomal plasmids.
Given the fact that almost all systems of gene delivery have been employed in factor reprogramming, even experienced reprogrammers encounter difficulty and frustration in choosing which approaches to use for a specific reprogramming project. For example, the most widely used retroviral vectors include three different systems representing various tropisms: ecotropism, amphotropism, and pantropism. Which system fits a specific reprogramming experiment will depend on cell types to be reprogrammed, end use of the iPSCs, and the nature of a reprogramming project. Wise selection of reprogramming vehicles is the key to success, and it requires knowledge of the technologies of various factor delivery systems. Therefore, a survey of different iPSC reprogramming technologies is provided. A supporting companion review on vectorology and factor delivery systems used in factor reprogramming is available [12]. Chemical reprogramming is not covered here, because it does not involve a special technology other than a simple inclusion of reprogramming chemicals in the reprogramming medium. Several reviews on chemical reprogramming are available [13 –17].
Reprogramming with Retroviral Vectors
Retroviral vectors [12] are the most widely used vehicles for gene therapy and transgene delivery in cultured cells and animal models [18]. They also played a seminal role in the historical development of iPSC technology due to their ability to efficiently transduce murine cells for a lasting expression of transgenes. After screening 24 murine genes using ecotropic retroviral vectors, Yamanaka and Takahashi found that the quadruplet of Oct4 (also known as Pou5f1), Sox2, c-Myc, and Klf4 can convert murine somatic cells into embryonic stem cell (ESC)-like cells (iPSC) [1]. The early and many ensuing murine reprogramming experiments employed retroviral vectors [1,3,19 –21]. However, ecotropic retroviral vectors do not transduce human cells, yet human iPSCs were generated with an ecotropic retrosystem after the human fibroblasts were sensitized by a murine viral receptor, mCAT1, delivered by LV [19,22,23]. This extra step could have been avoided if the authors had used pantropic retroviral vectors or LV for the delivery of reprogramming genes (see below). A second pseudotype of retroviral vectors, amphotropic retroparticles, transduces human cells at a low efficiency (16% for fibroblasts) that is generally insufficient for reprogramming although some human cells, including human mesenchymal stem cells [24] and neonatal dermal fibroblasts [25], are reported to be reprogrammed by amphotropic vectors. However, amphotropic retroparticles appear to transduce human keratinocytes efficiently with two rounds of spinoculations, and reprogrammed human keratinocytes into iPSCs [26,27]. Although transduction efficiency of human CD133+ cells with amphotropic retroviral vectors is as low as 28%, CD133+ cells were still reprogrammed by three rounds of viral transductions [28] due to the high reprogrammability of hematopoietic stem cells [28,29]. Amphotropic retroviral vectors can reprogram nonhuman primate fibroblasts [30,31]. A third type of retroviral vectors, pantropic retroviral vectors pseudotyped with vesicular stomatitis virus G (VSV-G) protein, can transduce human cells more efficiently. VSV-G-pseudotyped retroviral vectors reprogrammed activated human T cells [32], human umbilical vein endothelial cells [33], human adipose stem cells (hASCs) [34,35], fibroblasts [36,37], human neural stem cells (hNSCs) [38], and diseased human fibroblasts [39]. Pantropic retroviral vectors also enable reprogramming of nonhuman primate cells [40,41]. Recently, human cells from urine samples were reprogrammed surprisingly with ecotropic retroviral vectors [42,43]. This success indicates that ecotropic retroviral vectors may transduce some types of human cells. The efficiency of many human reprogramming experiments using amphotropic or ecotropic vectors could have been significanly increased if the authors had used pantropic retroviral vectors. Retroviral vectors transduce only dividing cells, and, therefore, nondividing cells and slow-growing cells are more difficult to be reprogrammed with retroviral vectors. For example, due to low transduction efficiency with retroviral vectors, researchers used preintegrated reprogramming factors to reprogram mouse B cells, for which the starting B cells were isolated from transgenic mice that harbored inducible reprogramming factors [44]. Such a requirement of cell division for transduction may explain the frequent use of repeated transductions in many retroviral reprogramming protocols. Since retroviral vectors transduce only cells that divide shortly after infection, consecutive transductions increase the population of infected cells. Reprogramming with retroviral vectors might be compromised by premature silencing of reprogramming factors [45], by de novo methytransferase Dnmt3a and Dnmt3b [46], and by the retroviral silencing factor TRIM28 [45,47]. Retroviral vector silencing starts as early as day 4 of reprogramming [45]. Viral integration is not required for successul reprogramming, as many nonintegration approaches have proved to be capable of reprogramming [9,31,48 –53]. However, viral transduction per se surprisingly promotes reprogramming due to favorable epigenetic changes provoked by viral activation of the TLR3 innate immune response pathway [54]. The titers of ecotropic and amphotropic vectors decrease dramatically after concentration and/or freezing, and, therefore, fresh preparations of retroviral vectors are usually used in retroviral reprogramming protocols [55]. This is less convenient than the standard LV, which can be concentrated and frozen without significant loss of titer. Nevertheless, pantropic RV pseudotyped with VSV-G can be concentrated and frozen for later use in reprogramming [56]. In addition to many other drawbacks of retroviral reprogramming (insertional mutagenesis, transgene reactivation, and residual expression), retroviral vector-generated iPSCs are much more immunogenic than those induced with nonviral vectors [4], which compromises the value of such cells in clinical applications.
Reprogramming with LV
LV [12] are another widely used vehicle for the delivery of transgenes in reprogramming. Due to the broadened tropism with VSV-G pseudotyping, LV are ubiquitously used for reprogramming of human cells [2,57 –59], murine cells [45,58,60], rat cells [61], and pig cells [62]. One advantage of LV is their abilities to transduce nondividing cells. Due to this trait, LV were used to successfully reprogram murine terminal β cells, which divide only one time in culture [60]. Although LV tranduce nondividing cells, repeated transductions were still employed for reprogramming of mouse and human melanocytes [58]. LV are reported to be more efficient in reprogramming than retroviral vectors because of slower silencing of reprogramming factors [45]. Inducible LV are useful tools in the study of reprogramming [44,45,58,63 –65]. The use of inducible LV enables secondary reprogramming to be performed with cells differentiated from iPSCs previously generated with such vectors [44,58]. Such secondary cells avoid inefficient viral transduction of multiple reprogramming factors, and they are more homogeneous in copy number and insertion position of reprogramming factors [64]. Therefore, inducible LV furnish a valuable tool for the study of reprogramming mechanisms owing to the homogeneity and controlled expression of transgenes [45,64]. For example, inducible LV allow for the demonstration that the reprogramming of mouse fibroblasts requires at least 8 days of expression of exogenous reprogramming factors, and that transgene expression is still required for a period of time beyond the point at which Oct4 or Sox2 is activated [45]. Different reprogramming genes have a great impact on viral titer. For example, in our experience, LV encoding human KLF4 or LIN28 give titers of 2–10 times or 10–50 times lower, respectively, than those of LV encoding the OCT4 and c-MYC reprograming factors (our unpublished observations).
Reprogramming by Transient Transfection
Although transient transfection was not the choice of methods in the pioneering screening reprogramming, it turns out that transient transfection can generate both mouse and human iPSCs, albeit at a very low efficiency [51,66,67]. Yamanaka's group reported the successful generation of mouse iPSCs free of transgene sequences from mouse embryonic fibroblast (MEF) using conventional plasmids [51]. However, integrations were found in some of these iPSC lines; they had to transfect the cells at least four times and the reprogramming efficiency was at least one order lower than that of the virus-based methods. Interestingly, mouse iPSCs were generated by just one or two transfections of transient plasmids when nucleofection was employed. Nucleofection delivers plasmids directly into nuclei, and it gives a strong and early expression of transgenes. However, the majority of the iPSC lines generated by nucleofection harbored random integrations [66]. It is more difficult to generate human iPSCs with transient tranfections. In combination with valproic acid (VPA), Si-Tayeb et al. generated a single human iPSC line with regular plasmids [67]. Another limitation with reprogramming by transient transfection is that most types of normal cells are difficult to transfect with current transfection technologies [68]. For the generation of patient-specific iPSC lines, primary cells should be used as the starting cells. Electroporation can be used to transfect the hard-to-transfect primary cells, but it is not practical to use electroporation if multiple rounds of transfections are required, because electroporation requires detachment of cells (adherent cells). In addition, electroporation usually results in significant cell death. As mentioned earlier, electroporation also promotes transgene integration and this voids the point of using this approach to eliminate integration. A new class of biodegradable, low toxic transfection reagents, poly-β-amino esters, has enhanced transfection efficiency for human cells [69], and facilitates the generation of human iPSCs with transient plasmids [70]. However, as with nucleofection, the enhanced reprogramming may be attributed to the increased integration of the reprogramming factors, as all tested iPSC lines harbored integrated copies of transgenes.
Another factor impairing reprogramming with transient plasmids is the rapid silencing of transgenes because of the bacterial backbone sequences. Minicircles containing a eukaryotic expression cassette without a bacterial backbone provide longer and stronger expression. Such a system is reported to reprogram hASCs into iPSCs, but at a very low efficiency of 0.005% even with three consecutive transfections due to the transient nature of minicircle DNA [71,72]. Literally, six canonical reprogramming factors were acting in their reprogramming, as hASCs express higher levels of KLF4 and c-MYC than ES cells, and the authors used the four reprogramming factors OCT4, SOX2, NANOG, and LIN28. The same set of reprogramming factors in minicircles has not been sucessful in reprogramming human adult dermal fibroblasts although neonatal fibroblasts (IMR90) were reprogrammed into iPSCs at an efficiency 10 times lower than hASC. It is not clear whether a minicircle is really more efficient in reprogramming than regular plasmids, as no side-by-side comparison has ever been done.
Protein Reprogramming
Any DNA-based reprogramming approach will present the possibility of random integration. The chance of integration increases for reprogramming with transient plasmids because of repeated transfections. Nucleofection appears to promote the integration of reprogramming plasmids [51,66]. One ideal method for the generation of transgene-free iPSCs is, therefore, protein transduction [12]. Zhou et al. reported the generation of mouse iPSCs using four rounds of protein transduction of the reprogramming factors tagged with poly-arginine in the presence of VPA [48]. Their reprogramming proteins were produced in Escherichia coli. Protein purification and refolding are, therefore, necessary. Human iPSC lines have also been established with protein transductions without VPA [73]. In contrast to producing the mouse reprogramming proteins in E. coli, the human reprogramming employed reprogramming proteins produced in HEK293 cells. Reprogramming proteins produced in mammalian cells might be superior to those from bacteria, as bacteria lack eukaryotic posttranslational modifications. In addition, the bacterial recombinant proteins may have been mis-folded during the artificial renaturation process, and this might explain why proteins from HEK293 do not require VPA for complete reprogramming. However, this method is extremely inefficient (0.001%), and slow (8 weeks). This low efficacy might partially be a result of the fact that the investigators used whole cell lysates from the expressing cells; purification of the reprogramming factors might improve the efficiency. It was recently shown that activation of the TLR3 pathway due to the innate immune response to viral infection benefits reprogramming, and, indeed, the efficiency of protein reprogramming was increasesed by treatment of the reprogramming cells with the TLR3 agonist poly(I:C) [54]. Interestingly, mouse iPSC lines were successfully established from primary fibroblasts with only one single treatment with cell-free protein extracts from mouse ESCs [74]; while previous reprogramming of mouse NIH/3T3 fibroblasts and human 293T cells with mouse ESC extracts and extracts of human pluripotent carcinoma cells, respectively, generated only partially reprogrammed pluripotent cells [75,76]. Since the acting factors in the reprogramming with pluripotent cell extracts are native proteins without any protein transduction domain, these experiments had to use streptolysin O to reversibly permeate the cell membrane for protein entrance [77].
Reprogramming with RNA Viral Vectors
Sendai virus (SeV) is a nonsegmented negative-strand RNA virus that has an exclusively cytoplasmic replication cycle without any DNA phase. To establish transgene-free iPSCs, cytoplasmic SeV-based vectors (SeVV) [12] have been used to reprogram mouse and human cells into iPSCs [52,78 –82]. F-deficient SeV vector (SeVV/ΔF) can efficiently reprogram human fibroblasts [78] and human activated T cells from peripheral blood [52,79]. A portion of the reprogrammed cells lost all viral genomes due to dilution over cell proliferation, and transgene-free iPSCs were subsequently established by removing virus-containing cells through negative selection by treatment of the reprogrammed cells with an antibody against the viral surface protein HN residing on the surface of the reprogrammed cells. The clearance of viral genomes from the established iPSCs can be accelerated by using temperature-sensitive SeVV rendered by point mutations in the viral polymerase genes P and L [81]. iPSCs generated using ts-SeVV lose viral genomes faster than the conventional SeVV/ΔF, and a short temperature shift can virtually remove all viral genomes from iPSCs. The earlier methods use multiple vectors for delivery of the four reprogramming factors, and coordinated expression is compromised due to homologous viral interference when coinfected [80,83]. An SeVV/ΔMΔFΔHN vector was, therefore, developed to deliver all four reprogramming factors in a single vector, in which all structure genes are deleted from the vector and replaced with the four reprogramming factors. Reprogramming with single SeVV/ΔMΔFΔHN was more efficient than with multiple SeVVs, and it might be the most efficient system as it is 100 times more efficient than the standard retroviral reprogramming [80]. Such an SeVV/ΔMΔFΔHN devoid of all structure genes also has the additional merits of enhanced safety and reduced immune response to the infected cells. Without the HN surface protein on the established iPSCs, virus genomes can still be efficiently removed by transfection of the generated iPSCs with siRNAs against the L gene, which encodes the enzyme essential for viral genome replication [80]. Genes delivered by SeVVs cannot be silenced by epigenetic modification, as there is no DNA phase. While this is an advantage in terms of persistent gene expression, it is not sure whether this will be a problem after the endogenous pluripotent genes are activated considering that the expression levels of pluripotent genes should be strictly controlled. Although SeV is not pathogenic to human, it naturally infects the airway epithelium [84], and, therefore, extreme care should be exercised in handing packaged virions when oncogenes such as c-Myc and Klf4 are installed in this vector. SeV is strongly fusogenic [85]. It is not known whether any tetraploid iPSCs have ever been generated in previous research due to cell fusion during reprogramming.
Reprogramming with Synthetic mRNA and Synthetic RNA Replicon
A second RNA strategy for the generation of iPSCs free of transgene integration is to deliver synthetic mRNA encoding reprogramming factors directly into the cells to be reprogrammed. Indeed, Warren et al. [50] converted the human ESC-derived fibroblasts into iPSCs using synthetic mRNA of the Yamanaka factors [OCT4, SOX2, KLF4, and c-MYC (OSKM)]. The addition of LIN28 mRNA to the Yamanaka combination (OSKML) can reprogram more refractory human fibroblasts (D551, MRC-5, BJ, and skin biopsy from an adult cystic fibrosis patient). Due to the transient nature of RNA transfection, daily transfection for appoximately 17 days is required in their protocol. Foreign single-stranded RNA induces innate immune response from the transfected cells through the interferon/NFκB-dependent pathway [50,86]. To make reprogramming successful, modifications of mRNAs are, therefore, necessary to reduce the innate immune response. In addition to removal of the immunogenic 5′ triphosphate from the residual uncapped mRNAs by phosphatase treatment, the reprogramming mRNAs should be modified by both 5-methyl-cytosine and pseudouridine for efficient reprogramming. Medium supplementation with interferon inhibitors is also advantageous. Interestingly, the iPSCs generated by mRNA reprogramming were more closely related to ESCs than those reprogrammed using viral vectors, and the authors suggested that the lack of residual expression of the reprogramming factors in mRNA reprogramming may contribute to this difference. The same mRNA cocktail can also reprogram human cells in a feeder-free system [87]. The key to successful reprogramming with synthetic mRNAs is to avoid the cell death due to a strong innate immune response to long synthetic mRNAs, and this explains why several other groups achieved only partial reprogramming with synthetic unmodified reprogramming mRNAs [86,88,89].
mRNA reprogramming requires repeated tranfections for approximately 17 times, which needs careful optimization of the transfection reagent, media, and procedure. Repeated tranfections exclude the use of electroporation. The need for multiple transfections also limits the use of mRNA reprogramming in hard-to-transfect cells such as blood cells, which are one of the most convenient sources of cells for reprogramming [29,53]. To overcome this, Dowdy's group recently used synthetic RNA replicon to deliver reprogramming factors into human reprogramming cells [90]. Their RNA replicon is based on the Venezuelan equine encephalitis (VEE) virus. VEE virus belongs to alphaviruses that are enveloped, single-stranded, postive-sense RNA viruses without any DNA phase in their life. An alphavirus replicates exclusively in the cytoplasm using the virus-encoded RNA-dependent RNA polymerase (RdRP). A noninfectious, but self-replicating alphaviral vector (RNA replicon) can be constructed by replacing the structure genes with genes of interest. RNA replicons can be delivered into cells in the form of packaged virions or as synthetic RNA genomes (RNA replicons). The latter can be achieved by in vitro transcription using SP6 or T7 promoter systems. On delivery into cells, a synthetic alphaviral RNA genome can self-replicate, as it retains RdRP. RNA replicons need only one transfection for successful reprogramming. However, alphavirus is extremely cytopathic [91,92], and reprogramming cannot be achieved without the suppression of interferon with the B18R protein. A caveat to the use of RNA replicons (SeVV and alphaviral vectors) is that RNA or RNA viruses can be converted into cDNA by possible reverse transcriptases in the human genome, and the resulting cDNA can still integrate [93,94], as demonstrated by the prevailing presence of processed pseudogenes in the human genome [95], and recent experimental evidence for the genomic integration of nonretroviral cDNAs [96]. Such reverse transcriptase activity can emanate from the active members of the human endogenous retroviruses, HERV-K [97,98]. Therefore, even iPSC lines generated by RNA approaches should be carefully examined for possible integrations.
Episomal Reprogramming
Reprogramming with conventional plasmids requires multiple rounds of transfections because of their incapability to replicate in mamallian cells. To overcome this, Yu et al. used Epstein Barr virus (EBV)-based self-replicating episomal vectors to successfully reprogram human fibroblasts [49]. EBV episomal plasmids need only one transfection for successful reprogramming due to their ability to replicate and partition in mammalian cells. However, the efficiency is so low that the four canonical reprogramming factors are not sufficient to drive reprogramming to completion, and a total of five to seven factors are required for the generation of iPSCs [31,49,99 –102]. The efficiency of episomal reprogramming can be increased by using blood progenitors as starting cells, but additional factors are still used [29,31,53,100]. The reprogramming efficiency can also be greatly increased using a xenofree culture system developed recently, but seven reprogramming factors were still used for reprogramming human fibroblasts [99]. Using a similar EBV episomal system, OCT4 and NANOG are sufficient to reprogram the hNSC line, ReNcell-VM [103]. This is not surprising, as neural stem cells represent the most reprogrammable cells [103 –105] possibly due to their expression of the reprogramming factors SOX2, MYC, and KLF4 [38,104,105]. Integration-free iPSC lines can be established owing to the instability of EBV plasmids in primate cells [106,107]. The episomal iPSCs may have higher quality, as such iPSCs displayed much lower immunogenicity than those generated by viral vectors [4]. The relatively lower efficiency of episomal reprogramming seems intrinsic to the vector itself given the following three observations. First, transgenes in the EBV vector are silenced quickly by human cells due to quick and extensive methylation of the vector sequence [107,108]. Second, although EBV plasmids can replicate efficiently, the establishment of a stable state is an infrequent event, and 30% of the transfected cells lose plasmids with each cell division cycle during the first 2 weeks post-transfection [107]. Third, the “established” EBV plasmids in human cells are still not stable on withdrawal of the selection pressure, and are reported to be lost in 2% to 5% of the cells in each cell division cycle [106,107]. The expression level of Epstein-Barr virus nuclear antigen 1 (EBNA1) in current constructs seems to be a limiting factor, as reprogramming efficiency can be enhanced by either inclusion of a separate EBNA1-expressing plasmid [109], or introduction of the EBNA1-coding mRNA [99]. Recently, it has been reported that an improved EBV vector can reprogram human cord blood CD34+ cells into iPSCs with only two factors (OCT4 and SOX2) [110]. The EBV plasmids cannot replicate in murine cells. Fortunately, polyoma-based episomal vectors can be stably maintained in mouse cells [111,112], and such vectors were used to reprogram murine cells into iPSCs [4].
Polycistronic Reprogramming
To achieve complete reprogramming, four factors are required. Early reprogramming constructs contained a single individual reprogramming gene on each plasmid, and, therefore, at least four integrations were necessary for successful reprogramming due to the use of four separate vectors [1,2]. In fact, it was found that each reprogramming factor had multiple integrations [3,19,20] with as many as 8 copies per reprogrammed genome for a single reprogramming factor [3]. To reduce the number of integrations, single-vector reprogramming systems were developed [66,113 –115]. In such systems, the four genes were linked by 2A sequences. This single-vector system expresses the four reprogramming factors as a single transcript, but results in four individual proteins through mechanisms of self-cleavage or ribosomal skip mediated by the 2A peptides [116,117]. The single-vector approach can reduce the number of integrations to one, but integration still exists.
A second advantage by installing all of the four reprogramming genes on one vector is that every infected cell receives a full set of reprogramming genes. It is known that an increased number of different types of viral particles in a mixed viral sample (different genes) decreases the co-infection of an individual cell [118]. However, placing four genes in one vector significantly compromises the packaging efficiency of viral particles due to the increased size of the viral genome. Internal ribosome entry site was also used for co-expression of reprogramming factors in the generation of human iPSCs in combination with multiple promoters [49].
Excisable Reprogramming Constructs
The Cre-Lox system was used to excise integrated transgenes from the reprogrammed genomes. Kaji et al. [119] constructed a single plasmid containing the four reprogramming factors linked by 2A sequences. The polycistron is flanked by Lox sequences. Linearization of the plasmids promotes the integration of the reprogramming cassette into the mouse genome, and therefore permits the reprogramming with a single transfection. The resulting iPSCs were then transfected with Cre recombinase to excise the transgenes. The overall reprogramming efficiency is actually low. This method is not yet demonstrated for the reprogramming of human cells. This approach can remove the risk of residual expression and re-activation of reprogramming factors, but the risk of insertional mutagenesis remains, because any sequence beyond the Lox sequences cannot be excised out. Such insertional mutagenesis by the residual inserted sequence can be reduced or eliminated by selecting a reprogrammed genome that harbors a single integration in an intron of a gene [120]. The same excisable system can be incorperated into LV [113].
A more advanced excision-after-reprogramming approach is the use of PiggyBac (PB) transposon. The PB vector was successfully used to reprogram MEF, and, subsequently, the introduced exogenous sequences were completely excised seamlessly by transient tranfection of the resulting iPSCs with the PB transposase to successfully generate several mouse iPSC lines without any genomic alteration of the reprogrammed genomes [121]. Four individual donor plasmids appear to give rise to too many integrations in the reprogrammed genomes (an average of nine integrations per reprogrammed genome), and the subsequent excision becomes a daunting task due to the low efficiency of transposon excision, and secondary transposition. To facilitate transposon removal, the authors then designed a single-plasmid donor vector containing the four reprogramming genes linked by 2A so as to reduce the number of transposon integrations. The authors actually obtained two mouse iPSC lines with a single integration per line out of 48 iPSC lines, and the integrated sequences were cleanly excised at a frequency of 2% by transient expression of transposase. The authors also reported the generation of human iPSCs by PB transposition, but no human transgene-free line was mentioned in the report. The excision process can be facilitated by reducing the amount of transposon plasmids, but this can decrease reprogramming efficiency too as at least two sets of integrated reprogramming factors are needed for efficient reprogramming [122]. The screening process can be simplified by inclusion of the negative selection gene, HSV-tk, in the transposon cassette [122]. PiggyBac systems provide an avenue to generate iPSC lines without genome alteration. It is also simpler and less immunogenic than the viral systems [123]. There are several concerns though. First, it is still an integration-based method, and therefore requires an extra excision step. Second, in the excision process, the transposase not only excises the transposon, but also transposes transposons into other locations of the genome. This secondary transposition complicates the excision step. Third, due to efficient integration and inefficient excision of this system, an extra screening step for iPSC lines with only a single integration using Southern hybridization is highly preferred before the excision process. It is reported that single integration is suboptimal for reprogramming and represents the rare populations of reprogrammed cells, and thus Yusa et al. had to use iPSC lines with two integrations for the subsequent transposon excision [122]. Fourth, although the transposase cuts the transposons precisely, imperfect excisions occur. Fifth, due to the incomplete removal, imperfect excision (microdeletion and small duplication), and re-integration of the excised copy, a high-resolution screening after the excision is required to ensure the integrity of the reprogrammed genomes. Sixth, during transposition, the transposase cuts the donor plasmid into two parts, and the linearized nontransposon half might have a higher chance to undergo random integrations. The current protocols ignore the possible integrations of nontransposon elements of the donor plasmids. Finally, the human genome harbors many PB-like transposons [124,125] and this raises two concerns. First, it is not known whether these PB-like human “transposases” transpose the introduced PB transposons into any cryptic locations that can no longer be excised by the “canonical” transposase. The second concern is the possible mobilization of human PB-like elements by the introduced active transposase. While the human PB-transposases might be degenerate and inactive, the latter possibility could be highly real given that the cis-elements of PiggyBac transposon are very short. Evidence showed that terminal repeats only are sufficient in transposition although the additional sequences increase the transposition efficiency [126,127]. This low stringency might increase the chance for some homologous sequences in the human genome to be targets of the introduced transposase given that there are around 2,000 PB-like elements dispersed in the human genome [125]. One last particular concern is that the transposases should be used twice (one for reprogramming transfection, and the other for excision). Therefore, double uses of transposase plasmids also increase the risk of integrations of the transposase plasmids. Unfortunately, current screening protocols ignore the possible integrations of this helper plasmid.
Secondary Reprogramming with Preintegrated Inducible Reprogramming Factors
Reprogramming is a slow, inefficient, stochastic process with a long latency. Little is known about this complex process, and it is difficult to study due to its complex nature. Inefficient cell transfection with multiple reprogramming factors is a limiting factor in the study of the reprogramming process. To develop controllable and efficient reprogramming approaches for the study of the reprogramming process, iPSCs were generated using inducible lentiviral or PiggyBac constructs [128,129]. Such iPSCs were then used to generate transgenic mice. Somatic cells harboring the four inducible reprogramming factors were then isolated from these mice for the study of the reprogramming process. Unlike the cells from de novo transduction, these transgenic cells are genetically homogeneous in terms of location and number of transgene integrations. Many types of cells isolated from such iPSC-derived mice can be efficiently reprogrammed into secondary iPSCs by doxycycline induction, and therefore prove that this system is helpful in the study of reprogramming kinetics, drug screening, and screening for additional reprogramming factors and for reprogramming roadblocks [129]. This strategy helped identify C/EBP as a promoter of, and Pax5 as an inhibitor of reprogramming the hard-to-transfect terminally differentiated B cells [44]. Using the same system, Hanna et al. established that reprogramming is a stochastic process, and disproved the elite model of reprogramming [63]. A similar system was also used in human cells [64,130]. Human fibroblasts and keratinocytes have been reprogrammed into iPSCs using inducible lentiviral reprogramming constructs. The resulting human iPSCs were then differentiated into secondary fibroblasts, and theses iPSC-derived fibroblasts were used for secondary reprogramming. Kinetics of human reprogramming was examined using such secondary inducible reprogramming by timed withdrawal of the inducer (doxycycline) [64,130]. A limitation of this system is that some preintegrated factors become noninducible due to silencing (position effect) [130]. Cells from different reprogrammable mouse strains also vary in reprogrammability due to varied integration numbers of reprogramming cassette introduced randomly by LV or PiggyBac transposon [129]. This variability might be overcome by targeting a single inducible reprogramming cassette to a defined locus, such as Col1a1, using Flp recombinase [131].
Adenoviral Reprogramming
Stadtfeld et al. reported the generation of transgene-free mouse iPSCs from hepatocytes using adenoviral vectors [132]. However, they failed to produce iPSCs from mouse fibroblasts with this system. This might be because hepatocytes need lower expression of the reprogramming factors for the generation of iPSCs than fibroblasts, and because adenoviral vectors transduce hepatocytes more efficiently [133]. Even with hepatocytes, reprogramming efficiency is very low compared with retrovirus-mediated reprogramming. This is reflected by the fact that Yamanaka's group failed to generate mouse iPSCs from hepatocytes with adenoviral vectors [51]. The low reprogramming efficiency with adenoviral vectors may be attributed to transgene dilution over time due to cell division. Human iPSC lines were established later from fibroblasts using adenoviral vectors, but extremely high viral titers had to be used (>200 pfu/cell for adenovirus vs 5 cfu/cell for integrating lentiviral or retroviral vectors) [134]. One caveat with adenoviral reprogramming is that adenoviral vectors have an extremely high level of integration compared with naked plasmid DNA [135] even though adenoviral vectors are generally considered nonintegrating vehicles. At much lower multiplicity of infection/transduction (MOI, 10 pfu per cell), they have an integration efficiency of 10−3 to 10−5 per cell, and can reach approximately 1% for some types of cells [135]. As mentioned earlier, reprogramming with adenoviral vectors requires a much higher MOI, and this implies an increased chance of transgene integration. Furthermore, integrated adenoviral vectors undergo rearrangement [135], and this makes it difficult to identify iPSC lines free of foreign sequences.
Reprogramming with miRNA Mimics
miRNAs are noncoding, single-stranded, short regulatory RNAs with around 22 nts. Their main function is the suppression of gene expression by either breakdown of target mRNAs or translation supression [136,137]. It is now well known that miRNAs play major roles in reprogramming [138,139]. Knockdown of Ago2, Drosha, or Dicer, components for miRNA biogenesis, severely impaired reprogramming [140]. Some miRNAs enriched in or specific to ESCs were demonstrated to promote reprogramming in either the form of synthetic mature miRNA or virus-delivered transgenes [140 –143]. Unexpectedly, miRNAs can reprogram both human and mouse cells into pluripotent cells without the help of any other reprogramming factors [144 –147]. Therefore, reprogramming with synthetic miRNAs may represent an ideal modality for the generation of human iPSCs for clinical application considering several advantages of miRNA mimics [148]. First, they have no risk of genome integration. Second, miRNAs are small, and can be readily synthesized and delivered into cells. In addition, on incorporation into ribonucleoprotein complexes, they remain stable for several days, and, therefore, they function longer in transfected cells than the coding RNAs, which is a basic requirement for the reprogramming process. However, transfection of the reprogramming cells with miRNA mimics still provides only a transient action, and multiple transfections are required for complete reprogramming. Finally, feasibility of miRNA-mimic reprogramming awaits confirmation and/or improvement from more laboratories.
Concluding Remarks
Factor reprogramming to pluripotency is an inefficient, slow, stochastic process requiring the overexpression of multiple genes. Canonical and efficient reprogramming is mediated by integrating viral vectors. Retroviral reprogramming is still a prefered method in research reprogramming, but vector tropisms should be considered. HIV1-based, VSV-G-pseudotyped LV should be a better choice than retroviral vectors due to their broadened tropism, ability to transduce nondividing cells, ease of handling, reprogramming efficiency, and availability of inducible systems. One major goal of reprogramming research since the advent of iPSC technology is to eliminate or reduce transgene integrations so as to avoid insertional mutagenesis, and residual expression and re-activation of reprogramming factors. To achieve this, many alternative approaches have been introduced. One common limitation for most of these improvements is the lower efficiency of reprogramming compared with the founding LV and retroviral vectors. SeVV might be the only method that has a higher reprogramming efficiency, while factor integrations are eliminated. The EBV-based episomal system is the simplest nonintegrating approach with reasonable efficiency, because it needs only one transfection with standard Maxiprep DNA. EBV-episomal reprogramming might be a good choice for clinical reprogramming if immunogenicity of SeVV reprogramming is considered. iPSCs generated with RNA virus/replicon still has the possibility of transgene insertion given the existence of active endogenous retroviruses in human genome. With this said, reprogramming with protein, chemicals, and miRNA mimics might be the most reliable approach to avoid genome alterations. One focus of future studies will be the enhancement of reprogramming efficiency. The four canonical reprogramming factors play different roles and function at different time points of reprogramming, and sequential introductions of the reprogramming factors are reported to be beneficial to reprogramming [149]. The stoichiometry of reprogramming factors affects both reprogramming efficiency and iPSC quality [150,151]. Protocol optimization should consider these facts.
Glossary
Spinoculation: also known as centrifugal infection, a viral transduction/infection process that resorts to centrifugation to enhance deposition of viral particles onto cell surface for increased transduction/infection.
Pseudotype: pseudotype is a virus in which its original native envelope protein is replaced with that of another virus. Pseudotyping is used to alter the tropism (usually expand) of viral vector and enhance the stability of vector particles.
Nucleofection: a proprietary technology for cell transfection through electric shock and cell-type specific buffers, which can efficiently deliver nucleic acid directly into the nucleus for quick and strong expression.
Footnotes
Acknowledgments
Author Disclosure Statement
The author declares no competing financial interests.
