Materials and methods
Analysis of collagen family expression and function
Transcriptomic and proteomic analysis of human tissue cells
Healthy human tissue RNA-seq and single cell RNA-seq was downloaded from The Human Protein Atlas, Tissue-specific Gene Expression and Regulation, Genotype-Tissue Expression Project, Illumina Body Map, FAMTON5, ENCODE and The European Bioinformatics Institute and NCBI GEO database (Table S1). RNA-seq data is processed according to the method from previous report (Zoabi and Shomron, 2021). RNA and protein expression atlas was analyzed and demonstrated by R studio.
Tertiary structure prediction by AlphaFold2 and amino acid mutation analysis
Protein tertiary structure was predicted using AlphaFold2 (Bkunyun Platform, https://www.bkunyun.com/index.html?source=360E8&other=). Those structures with the highest score were picked for the next analysis, and the relevant PDB files were compared by UCSF Chimera. The identified DNA mutation sites were verified by sanger sequencing and corresponding amino acid mutants were labelled within the structure. The re-constructed structure was also predicted by AlphaFold2 and the relevant amino acid side chain changes were exhibited by UCSF Chimera. Disorder regions and conversation were both predicted by IUPred3.
Physiological and pathological analysis
Disease-related point mutation data derived from NCBI ClinVar database were calculated and plotted by R studio. Patient prognosis data are derived from The Cancer Genome Atlas database, Gene Expression Profiling Interactive Analysis and PrognoScan database. Collagen interact proteins data were downloaded from The STRING database, and the GO analysis was performed using the package of R studio. CRISPR screen data of collagen was downloaded and analyzed from the BioGRID database. Databases were showed in Table S1.
Strains and plasmids
All strains and plasmids were showed in Table S4. DNA of T7 RNAP, pIII-N-tripeptide-Mxe GyrA intein-N, ELN, CO3A1, Mxe GyrA intein-C-pIII-C, pT7-CO3A1-GyrA-6his-pIII, GyrA-tripeptide, T7 terminator, pTrc-Kana (Q5X, L23X), LE-pTrc-T7 RNAP-terminor-RE, LE-pTrc-mf-lon-terminator-pT7-LacO-GFP-pdt-terminator-RE, LE-pBAD-cisA-RepΔ33-T5 DNAP(D164A, E166A, A593R)-RE, pTac-cisA initial site-moumou_P4Hc_cisA terminal site were synthesized from the IGE Biotechnology Co., Ltd (Guangzhou, China) after codon optimization of E. coli in coding region. Mxe GyrA intein-C-pIII-C, pT7-CO3A1-GyrA-6his-pIII, pT7-CO3A1-GyrA-6his-pIII and pT7-terminator-CO3A1-GyrA-6his-pIII DNA was inserted to pJC175e plasmid to replace PSPO-gIII gene. RE-pTrc-T7 RNAP-terminor-LE, RE-pTrc-T7 RNAP*-terminor-LE, RE-pTrc-mf-lon-terminator-pT7-LacO-GFP-pdt-terminator-LE DNA were inserted to pQcascade-IS1. cisA-RepΔ33-T5 DNAP (D164A, E166A, A593R) DNA was and inserted to pBAD18. pTac-cisA initial site-moumou_P4Hc_cisA terminal site was synthesized and inserted to pET28a skelecton. Self-circularized ribozyme(Qu et al., 2022) and tRNA clusters were inserted to correspondingly CO3A1 and ELN plasmid. All these plasmids were constructed from the IGE Biotechnology Co., Ltd (Guangzhou, China).
ARTP-PANCE for screening intein and T7 RNAP mutants in vivo
Inactive kanamycin resistance gene recovery experiment
1 μg of MP4 plasmid was transformed into DH5a strain carrying with sfGFP-Kana (Q5X, L23X) plasmid. Monoclonal strain was cultured in LB medium with 10 mM L-arabinose 100 mg/L Ampicillin for 24 h at 37°C. Plasmid was extracted and transformed in BL21 (DE3) strain. DH5a strain carrying with sfGFP-Kana (Q5X, L23X) plasmid was mutated by ARTP for 30 s with 4 times in room temperature, then transfer it to 1 ml LB medium with 100 mg/L ampicillin. Cultured at 37 °C overnight, plasmid was extracted and transformed in BL21(DE3) strain equally. The transformed BL21(DE3) strain was coated on LB plates with 200 mg/L kanamycin and 0.1 mM IPTG. Clone number of GFP strain was calculated under the blue light.
Primary M13 phage preparation
DNA fragments of T7 RNAP (with primer T7 RNAP-F/R) and pIII-N-tripeptide-Mxe GyrA intein-N (with primer gIIIGyrA-NF/R) were amplified by High Fidelity DNA Polymerase (Yeasen Biotechnology) using synthesized gene as template. M13 genome skelecton without gIII was amplified using M13 phage as template, with primer M13-F/R. DNA fragments of T7 RNAP and pIII-N-tripeptide-Mxe GyrA intein-N were respectively cloned into M13 genome skelecton using Multi One Step Cloning Kit (Yeasen Biotechnology). Cloning product was transformed in to S1030 strain containing pJC175e plasmid. The supernatant was diluted and infected into fresh S1030-pJC175e strain to determinate titer using the bilayer agarose plate method (Miller et al., 2020). Monoclonal phage was picked into fresh S1030-pJC175e strain to amplification. Bacterial PCR was applied to identify the correctness of fragment insertion, and PCR product was further verified by Sanger sequencing. Primers were showed in Table S5.
Process of ARTP-PANCE
PANCE was processed according to previous report (Miller et al., 2020), we used the MP4 as mutagenesis plasmid. Every round of ARTP-PANCE was divided into three steps. Firstly, primary M13 phage of GyrA or T7 RNAP was added into 100 μL fresh S1030 strain containing corresponding accessory plasmid. After incubation, the bacterial precipitate was re-suspended and mutated by ARTP (30 s, 120 W) treatment for 4 times, and then diluted to 10 mL. The titer of M13 phage was determined by DNA-LAMP method using LAMP Kit (Yeasen Biotechnology). Reaction mix was incubated at 63 °C for 10-20 min. The mutant strain was cultured at the temperature of 37 °C and at the rotate speed of 100 rpm. Phage titer was detected using DNA LAMP every 2 h, until that phage titer was up to 106 CFU/mL. If the phage titer was still lower than 106 CFU/mL, S1030-pJC175e was employed in amplifying the M13 phage. Finally, progeny M13 phage was obtained by collect the supernatant. The progeny of initial M13 phage was used as primary phage in the next round of ARTP-PANCE.
Activity assay of Mxe GyrA intein
SDS-PACE and western blot were performed according to previous reported (Fang et al., 2023) using HRP-conjugated His-tag, Mouse mAb as antibody and Enhanced ECL Kit (Yeasen Biotechnology) as developer region. The ratio of cleaved pIII to uncleaved pIII represents the efficiency of Mxe GyrA intein.
Activity assay of T7 RNA polymerase
Accessory plasmid was transformed into BL21 strain which genome IS1 site was integrated with T7 RNAP. When OD600 value was up to 0.6, a final concentration of 0.5 mM IPTG was added to medium and the strain continue to be cultured overnight at the temperature of 37 °C and at the rotate speed of 200 rpm. Western blot of Anti-6*his tag demonstrated above was used to detect the expression level of CO3A1 protein. Total RNA of these strains was extracted using MolPure® Bacterial RNA Kit (Yeasen Biotechnology, Cat.No. 19301ES50). RT-qPCR was employed in detecting the RNA level of CO3A1 using Hifair® Ⅲ One Step RT-qPCR SYBR Green Kit (Yeasen Biotechnology, Cat.No. 11143ES50).
Collagen and elastin purification
Chromatography-free purification method for collagen tripeptide mediated by intein
BL21 (DE3) strain containing pET28a plasmid with GyrA-tripeptide or GyrA*-tripeptide was cultured in LB medium (pH 8.5) with 50 mg/L kanamycin. When OD600 value was up to 0.6, 0.3 mM IPTG was added and cultured overnight. Bacteria was resuspended by 50 ml PBS (pH 9.0, 0.5 M NaCl). After high-pressure distillation and centrifugation, the supernatant used a 10 KDa ultrafiltration tube to intercept proteins above 10 KDa. The pH of the shut-off solution was adjusted to 6-6.5 by hydrochloric acid. After incubated at 30 °C overnight, the reaction product used a 1 KDa ultrafiltration tube to obtain the peptide smaller than 1 KDa. The concentration and purity of fluid-through was tested by HPLC.
Collagen and elastin purification by affinity chromatography
When OD600 value was up to 0.6, a final concentration of 0.5 mM IPTG was added to medium and the strain continue to be cultured for overnight at the temperature of 37 °C and at the rotate speed of 200 rpm. After high-pressure distillation and centrifugation, the supernatant was filtered by a 0.22 μm membrane and purified by HisSep Ni-NTA 6FF Chromatography Column according to the instruction manual (Yeasen Biotechnology).
CRISPR-associated transposon
CRISPR-associated transposon was performed according to previous reported (Vo et al., 2021). Briefly, BL21 strain containing pTnsABC, pQcascade-IS1 and doner plasmid was cultured in LB medium with 50 mg/L kanamycin and 50 mg/L spectinomycin at OD600 value to 0.6. The stain was induced using 0.2 mM IPTG at 37 °C overnight, and then coated in plates to isolate monoclonal strains. Insert fragments were identified using bacterial PCR, and PCR product was further verified by Sanger sequencing. Positive clones were passaged multiple times in antibiotic-free medium to lose the pTnsABC, pQcascade-IS1 and doner plasmid, until it could not grow in LB medium containing kanamycin or spectinomycin.
TADR-FADS for screening proline hydroxylase mutants in vivo
GFP target degradation mediated by mf-lon protease and GFP-pdt system
IS1 site in BL21 genome was integrated with pTrc-mf-lon-terminator-pT7-LacO-GFP-pdt-terminator via CRISPR-associated transposases described above. This stain was transformed with plasmid of MP4-pTac-cisA initial site-moumou_P4Hc_cisA terminal site. When OD600 value was up to 0.6, a final concentration of 0.5 mM IPTG was added to medium and the strain continue to be cultured overnight at the temperature of 37 °C and at the rotate speed of 200 rpm. GFP fluorescence intensity of strains is determined using a fluorescent microplate reader.
Process of targeted artificial DNA replisome (TADR) technology
This BL21 (DE3) stain was transformed with plasmid of pBAD-cisA-RepΔ33-T5 DNAP (D164A, E166A, A593R) and pT7-cisA site-P4Hc. When OD600 value was up to 0.6, a final concentration of 20 mM L- arabinose was added to medium and the strain continue to be cultured for 24 h at the temperature of 37 °C and at the rotate speed of 200 rpm. The plasmid was extracted, and transformed into BL21 (DE3) strain which genome was integrated with mf-lon-pT7-LacO-GFP-pdt.
Preparation of microfluid chips and microfluid devices
Microfluidic chips and microfluid devices were built according to the methods previously reported (Beneyton and Rossignol, 2021; Mazutis et al., 2013). Briefly, microfluidic chips were prepared using bypoly(dimethylsiloxane) (PDMS). After degassing under vacuum condition, the mold was cross-linked, punched and bound to glass microscope slide. Finally, hydrophobic surface coating was created and electrode hole were filled with. The optical setup consisted of a Compound Inverted Microscope System (Olympus) mounted on a dampening platform with a 488 nm laser. Emitted light was captured through a 510 nm bandpass filter (510/20-25; Semrock Inc.). A Phantom v4.2 high speed digital camera (Vision Research) was loaded on the microscope to capture light images during droplet manipulation. A dropmaker chip was applied to generate 20 pL droplets at 4000 Hz. Under sorting mode operation, reloading module in which droplets were reloaded (0.2 μL min−1) and spaced-out at a flow-focusing junction with HFE7500 fluorinated oil (3 M) (1 μL min−1). The droplets were sorted by the optical setup at ∼400 Hz.
Screen proline hydroxylase mutants in vivo
The strains containing P4Hc mutant library was cultured in LB medium with 50 mg/L kanamycin. When OD600 value was up to 0.6, the strains were diluted by LB medium with 50 mg/L kanamycin to OD600 value at 0.05 (OD600 of 1 corresponded to 5 × 108 cells per mL), and used as the liquid phase in FADS described above.
Activity assay of proline hydroxylase
Total proteins of E. coli were extracted using Bacterial Protein Extraction Kit (Sangon Biotech, Cat.No. C600596) and quantitated by BCA Protein Quantification Kit (Yeasen Biotechnology, Cat.No. 20201ES76). Proline hydroxylation level was determined by Hydroxyproline (HYP) content detection kit (Solarbio Life Sciences, Cat.No. BC0250).
Improved CFPS Reaction to synthesize collagen and elastin
The CPFS system was designed according to the methods previously reported (Des Soye et al., 2019; Jewett et al., 2008). Specifically, an improved CFPS reaction was assembled by mixing the following components: 50 mM HEPES, pH 7.2; 3 mM ATP and GTP, 0.1 mM UTP, and CTP; 34 mg/mL folinic acid; 100 mg/mL of E. coli tRNA mixture; 15 mg/mL plasmid or DNA; 1 mM for each of the 20 standard amino acids; 5 mM glycine, proline and 2-oxoglutarate; 0.33 mM nicotinamide adenine dinucleotide (NAD); 0.27 mM coenzyme-A (CoA); 1 mM putrescine; 4 mM sodium oxalate; 1.5 mM spermidine; 130 mM potassium glutamate; 10 mM ammonium glutamate; 12 mM magnesium glutamate; 33 mM phosphoenolpyruvate (PEP), and 30% v/v of cell extract. CFPS reaction was incubated for 8 h at 37 °C.