当前位置:文档之家› 生物信息学作业1.doc

生物信息学作业1.doc

生物信息学实验作业试验一一.找到编码拟南芥(arabidopsis)phyA(光敏色素A)基因的核酸序列编号, 并记录查找过程。

GI:224576211步骤1.进入NCBI主页2.搜索arabidopsis phyA3.Arabidopsis thaliana phytochrome A (PHYA) gene, partial cds4.VERSION:GI:224576211二.以phyA为检索词,在pubmed数据库中分别检索在题目和关键词字段中含有该检索词的文献,记录检索出的条目数目。

Results: 614三.仔细阅读所查询核酸序列在NCBI和EMBL数据库中格式的解释,理解各字段的含义,并比较NCBI 与EMBL中序列格式的异同。

实验二一.分析你感兴趣核酸序列的分子质量、碱基组成。

Composition 35 A; 25 C; 35 G; 15 T; 0 OTHERPercentage: 32% A; 23% C; 32% G; 14% T; 0%OTHERMolecular Weight (kDa): ssDNA: 34.26 dsDNA: 67.8二.列出你所分析核酸序列(或部分序列)的互补序列、反向序列、反向互补序列、DNA双链序列和RNA 序列。

R S1 ACTACTCGAG AAGCAGCGAC AGAGGCGTTA GCCCGCTCAG CAGACTGGCA GTTCTCTACC61 GACAAAAAAG AGGTAGGAGG CACAGTAATG ATACAGGCGT AGCAGGAGGGC S1 CCCTCCTGCT ACGCCTGTAT CATTACTGTG CCTCCTACCT CTTTTTTGTC GGTAGAGAAC61 TGCCAGTCTG CTGAGCGGGC TAACGCCTCT GTCGCTGCTT CTCGAGTAGTR C S1 TGATGAGCTC TTCGTCGCTG TCTCCGCAAT CGGGCGAGTC GTCTGACCGT CAAGAGATGG61 CTGTTTTTTC TCCATCCTCC GTGTCATTAC TATGTCCGCA TCGTCCTCCCD DNA S1 GGGAGGACGA TGCGGACATA GTAATGACAC GGAGGATGGA GAAAAAACAG CCATCTCTTGCCCTCCTGCT ACGCCTGTAT CATTACTGTG CCTCCTACCT CTTTTTTGTC GGTAGAGAAC61 ACGGTCAGAC GACTCGCCCG ATTGCGGAGA CAGCGACGAA GAGCTCATCATGCCAGTCTG CTGAGCGGGC TAACGCCTCT GTCGCTGCTT CTCGAGTAGTRNA S1 GGGAGGACGA UGCGGACAUA GUAAUGACAC GGAGGAUGGA GAAAAAACAG CCAUCUCUUG61 ACGGUCAGAC GACUCGCCCG AUUGCGGAGA CAGCGACGAA GAGCUCAUCA三.列出核酸序列的限制性酶切位点分析结果(酶及识别位点)。

Restriction analysis on USMethylation: dam-No dcm-NoScreened with 117 enzymes, 5 sites foundEcl136II 1 GAG/CTC103EcoICRI 1 GAG/CTC103SacI 1 GAGCT/C105SapI 1 GCTCTTCN/93SstI 1 GAGCT/C105List by Site Order93 SapI 103 Ecl136II 105 SstI 105 SacI103 EcoICRINon Cut EnzymesAatII Acc65I AccIII AclI AflII AgeIAhaIII Alw44I AlwNI ApaBI ApaI ApaLIAscI Asp718I AsuII AvrII BalI BamHIBbeI BbvII BclI BglI BglII Bpu1102IBsc91I BsiI BsmI Bsp1407I BspHI BspMIBspMII BssHII BstD102I BstEII BstXI Bsu36IClaI Csp45I CspI CvnI DraI DraIIIDrdI EagI Eam1105I Eco31I Eco47III Eco52IEco56I Eco57I Eco72I EcoNI EcoRI EcoRVEheI EspI FseI HindIII HpaI I-PpoIKpnI MfeI Mlu113I MluI MscI MstIMstII NaeI NarI NcoI NdeI NheINotI NruI NsiI PacI PflMI PinAIPmaCI PmeI PstI PvuI PvuII RleAISacII SalI SauI ScaI SciI SfiISgrAI SmaI SnaBI SpeI SphI SplISpoI SrfI SspI SstII StuI SunISwaI Tth111I VspI XbaI XcmI XhoIXmaI XmaIII XmnI XorIIRestriction sites on US1 GGGAGGACGATGCGGACATAGTAATGACACGGAGGATGGAGAAAAAACAGCCATCTCTTGSacISstIEcl136IISapI EcoICRI61 ACGGTCAGACGACTCGCCCGATTGCGGAGACAGCGACGAAGAGCTCATCA四.分析一对你所设计的引物,并对其进行综合评判。

2 GGAGGACGATGCGGACATAOligo: 5'-GGAGGACGATGCGGACATA-3'Primer1: 19 basesComposition 6 A; 3 C; 8 G; 2 T; 0 OTHERPercentage: 31% A; 15% C; 42% G; 10% T; 0%OTHERMW=5.99 kDaHybridization: D:DSalt: 50 mMFormamide: 0%Mismatch: 0 bpThermo Tm = 62.0 Hybridization Tm = 52.1 GC+AT Tm = 60.0 Primer-US(1-110) complementarity.First complementarity in continuous: 19 bp5'-GGAGGACGATGCGGACATA-3' Primer|||||||||||||||||||3'-CCTCCTGCTACGCCTGTAT-5' (20) Strand -No second possible complementarityMax complementarity in discontinuous: 19 bp5'-GGAGGACGATGCGGACATA-3' Primer|||||||||||||||||||3'-CCTCCTGCTACGCCTGTAT-5' (20) Strand -105 AGCTCTTCGTCGCTGTCTCCOligo: 5'-AGCTCTTCGTCGCTGTCTCC-3'Primer1: 20 basesComposition 1 A; 8 C; 4 G; 7 T; 0 OTHER Percentage: 5% A; 40% C; 20% G; 35% T; 0%OTHER MW=6.07 kDaHybridization: D:DSalt: 50 mMFormamide: 0%Mismatch: 0 bpThermo Tm = 62.2 Hybridization Tm = 54.5 GC+AT Tm = 64.0 Primer-US(1-110) complementarity.First complementarity in continuous: 20 bp5'-AGCTCTTCGTCGCTGTCTCC-3' Primer||||||||||||||||||||3'-TCGAGAAGCAGCGACAGAGG-5' (86) Strand +No second possible complementarityMax complementarity in discontinuous: 20 bp5'-AGCTCTTCGTCGCTGTCTCC-3' Primer||||||||||||||||||||3'-TCGAGAAGCAGCGACAGAGG-5' (86) Strand +五.运用Sequin软件进行序列提交,并打印你完成的序列提交文件(后缀为.sqn)。

LOCUS GY482612 110 bp mRNA linear UNA 17-FEB-2002 DEFINITION Sequence 33 from patent US 8030290.ACCESSION GY482612VERSION GY482612.1 GI:353292184KEYWORDS .SOURCE unidentifiedORGANISM unidentifiedunclassified sequences.REFERENCE 1 (bases 1 to 110)AUTHORS chen,h.TITLE Sequence 33 from patent US 8030290JOURNAL UnpublishedREFERENCE 2 (bases 1 to 110)AUTHORS chen,h.TITLE Direct SubmissionJOURNAL Submitted (17-FEB-2002) SCAU, Bio, yucheng, yanan, sichuan, Chinai FEATURES Location/Qualifierssource 1..110/organism="unidentified"/mol_type="mRNA"/db_xref="taxon:32644"BASE COUNT 35 a 25 c 35 g 15 tORIGIN1 gggaggacga tgcggacata gtaatgacac ggaggatgga gaaaaaacag ccatctcttg61 acggtcagac gactcgcccg attgcggaga cagcgacgaa gagctcatca//。

相关主题