BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= I09A02NGRL0002_I05 (586 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_UPI0000DB7985 Cluster: PREDICTED: similar to Collagen t... 164 2e-39 UniRef50_Q4SZ73 Cluster: Chromosome undetermined SCAF11805, whol... 150 2e-35 UniRef50_UPI0000613E3C Cluster: Collagen alpha-2(IV) chain; n=2;... 149 5e-35 UniRef50_P08120 Cluster: Collagen alpha-1(IV) chain precursor; n... 149 5e-35 UniRef50_P08572 Cluster: Collagen alpha-2(IV) chain precursor [C... 149 6e-35 UniRef50_P29400 Cluster: Collagen alpha-5(IV) chain precursor; n... 148 1e-34 UniRef50_Q07265 Cluster: 3 alpha procollagen; n=4; Strongylocent... 142 7e-33 UniRef50_P17139-2 Cluster: Isoform b of P17139 ; n=2; Caenorhabd... 137 2e-31 UniRef50_P55787 Cluster: Collagen alpha-4(IV) chain; n=46; Eumet... 134 1e-30 UniRef50_UPI00015B49AB Cluster: PREDICTED: similar to alpha-5 ty... 134 1e-30 UniRef50_P53420 Cluster: Collagen alpha-4(IV) chain precursor; n... 134 2e-30 UniRef50_Q26640 Cluster: Alpha2(IV)-like collagen; n=4; Strongyl... 133 3e-30 UniRef50_Q4SZ69 Cluster: Chromosome undetermined SCAF11805, whol... 105 1e-29 UniRef50_Q28084 Cluster: Collagen alpha-3(IV) chain; n=13; cellu... 129 4e-29 UniRef50_Q01955 Cluster: Collagen alpha-3(IV) chain precursor (G... 127 2e-28 UniRef50_Q58FS7 Cluster: Type IV collagen alpha 3 chain; n=2; Da... 126 5e-28 UniRef50_Q4S0I4 Cluster: Chromosome 2 SCAF14781, whole genome sh... 122 8e-27 UniRef50_Q7PVR6 Cluster: ENSANGP00000016652; n=3; Endopterygota|... 122 8e-27 UniRef50_Q9VMV5 Cluster: CG16858-PA; n=6; Schizophora|Rep: CG168... 120 3e-26 UniRef50_UPI000065E567 Cluster: Homolog of Brachydanio rerio "Co... 116 3e-25 UniRef50_UPI000065E566 Cluster: Homolog of Homo sapiens "Splice ... 115 7e-25 UniRef50_UPI00006608B5 Cluster: Homolog of Homo sapiens "Tumstat... 114 1e-24 UniRef50_Q9GQB1 Cluster: Type IV collagen alpha 1 chain precurso... 111 9e-24 UniRef50_UPI00015A592A Cluster: Type IV collagen alpha 4 chain; ... 95 1e-18 UniRef50_Q4TZW9 Cluster: Type IV collagen alpha 4 chain; n=3; Da... 95 1e-18 UniRef50_Q5C3P1 Cluster: SJCHGC06113 protein; n=2; Platyhelminth... 87 4e-16 UniRef50_O09238 Cluster: Collagen type IV; n=2; Pseudocorticium ... 84 2e-15 UniRef50_UPI0000DA44CD Cluster: PREDICTED: similar to procollage... 68 1e-10 UniRef50_UPI0000DBF028 Cluster: UPI0000DBF028 related cluster; n... 68 1e-10 UniRef50_Q5BYE6 Cluster: SJCHGC08138 protein; n=1; Schistosoma j... 57 3e-07 UniRef50_Q4SB07 Cluster: Chromosome undetermined SCAF14677, whol... 54 2e-06 UniRef50_A7T795 Cluster: Predicted protein; n=1; Nematostella ve... 47 4e-04 UniRef50_A7T3G2 Cluster: Predicted protein; n=1; Nematostella ve... 47 4e-04 UniRef50_A1ZSZ5 Cluster: Putative uncharacterized protein; n=1; ... 34 2.8 UniRef50_Q24BC9 Cluster: Putative uncharacterized protein; n=1; ... 33 5.0 >UniRef50_UPI0000DB7985 Cluster: PREDICTED: similar to Collagen type IV CG4145-PA, isoform A isoform 1; n=1; Apis mellifera|Rep: PREDICTED: similar to Collagen type IV CG4145-PA, isoform A isoform 1 - Apis mellifera Length = 1913 Score = 164 bits (398), Expect = 2e-39 Identities = 69/110 (62%), Positives = 83/110 (75%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ+L+IP CP GW+ LWIGYSF+MHT L+S GSCLEDFRA PFIECNG Sbjct: 1804 HSQSLNIPDCPQGWTGLWIGYSFLMHTGAGAQGGGQSLSSSGSCLEDFRATPFIECNGNK 1863 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVCIKNT 330 G CH++ N++SFW+ TIED QQF PE+QTLK+G L ++SRC VCIKNT Sbjct: 1864 GQCHYYMNEISFWMATIEDRQQFQAPEQQTLKAGNLRSKISRCQVCIKNT 1913 Score = 34.3 bits (75), Expect(2) = 0.005 Identities = 21/67 (31%), Positives = 34/67 (50%), Gaps = 2/67 (2%) Frame = +1 Query: 124 GSCLEDFRAIPFIECNGEGGTCHHFANK--LSFWLTTIEDSQQFAMPERQTLKSGRLLER 297 GSC+ F +PF+ C+ C H+ N+ S+WL+T MP ++ + + Sbjct: 1736 GSCVRKFSTMPFLFCD-INNVC-HYGNRGDRSYWLSTTSPIPM--MP----VQESEIEQY 1787 Query: 298 VSRCAVC 318 +SRC VC Sbjct: 1788 ISRCVVC 1794 Score = 27.9 bits (59), Expect(2) = 0.005 Identities = 11/24 (45%), Positives = 15/24 (62%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFV 72 HSQ+ +P C G +LW GYS + Sbjct: 1672 HSQSQLLPVCDAGHIKLWEGYSLL 1695 >UniRef50_Q4SZ73 Cluster: Chromosome undetermined SCAF11805, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF11805, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 1026 Score = 150 bits (364), Expect = 2e-35 Identities = 66/109 (60%), Positives = 77/109 (70%), Gaps = 1/109 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ + IP CP GW LWIGYSF+MHT L SPGSCLEDFRA PFIECNG Sbjct: 918 HSQDMTIPTCPPGWRSLWIGYSFLMHTAAGAEGGGQSLVSPGSCLEDFRATPFIECNGAK 977 Query: 181 GTCHHFANKLSFWLTTIEDSQQ-FAMPERQTLKSGRLLERVSRCAVCIK 324 GTCH+FANK SFWLTT++ +Q+ F P ++TLK G+ +VSRC VC K Sbjct: 978 GTCHYFANKYSFWLTTVDPNQEFFYSPSQETLKGGQERSKVSRCQVCSK 1026 Score = 64.9 bits (151), Expect = 1e-09 Identities = 35/106 (33%), Positives = 50/106 (47%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ +P CP G ++LW GYS +++ L PGSCL F IPF+ C+ Sbjct: 810 HSQDAQVPMCPQGMAKLWDGYS-LLYVEGQEKAHNQDLGQPGSCLPRFSTIPFLYCSPNE 868 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVC 318 + N S+WL+T + E Q + +SRC+VC Sbjct: 869 VCYYASRNDKSYWLSTTASIPMMPVAEAQ------IQAYISRCSVC 908 >UniRef50_UPI0000613E3C Cluster: Collagen alpha-2(IV) chain; n=2; Bos taurus|Rep: Collagen alpha-2(IV) chain - Bos Taurus Length = 227 Score = 149 bits (361), Expect = 5e-35 Identities = 64/109 (58%), Positives = 73/109 (66%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ + IP CP GW LWIGYSF+MHT L SPGSCLEDFRA PFIECNG Sbjct: 118 HSQDVSIPHCPAGWRSLWIGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFIECNGAR 177 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVCIKN 327 GTCH++ANK SFWLTTI + P TLK+G + +SRC VC+KN Sbjct: 178 GTCHYYANKYSFWLTTIPEQSFQGTPSADTLKAGLIRTHISRCQVCMKN 226 Score = 62.9 bits (146), Expect = 5e-09 Identities = 37/107 (34%), Positives = 53/107 (49%), Gaps = 1/107 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQT P CPVG ++LW GYS +++ L GSCL F +PF+ CN G Sbjct: 10 HSQTDKEPMCPVGMNKLWSGYS-LLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYCN-PG 67 Query: 181 GTCHHFA-NKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVC 318 C++ + N S+WL+T + E + +SRC+VC Sbjct: 68 DVCYYASRNDKSYWLSTTAPLPMMPVAEED------IRPYISRCSVC 108 >UniRef50_P08120 Cluster: Collagen alpha-1(IV) chain precursor; n=5; Diptera|Rep: Collagen alpha-1(IV) chain precursor - Drosophila melanogaster (Fruit fly) Length = 1775 Score = 149 bits (361), Expect = 5e-35 Identities = 62/111 (55%), Positives = 76/111 (68%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQT+++P CP GW LWIGYSF+MHT L SPGSCLEDFRA PFIECNG Sbjct: 1665 HSQTIEVPDCPNGWEGLWIGYSFLMHTAVGNGGGGQALQSPGSCLEDFRATPFIECNGAK 1724 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVCIKNTT 333 GTCH + SFW+ +E SQ F P++QT+K+G VSRC VC+KN++ Sbjct: 1725 GTCHFYETMTSFWMYNLESSQPFERPQQQTIKAGERQSHVSRCQVCMKNSS 1775 Score = 56.4 bits (130), Expect = 5e-07 Identities = 34/107 (31%), Positives = 55/107 (51%), Gaps = 1/107 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ+ +P C G +ELW GYS +++ L GSC+ F +P + C G+ Sbjct: 1560 HSQSETVPACSAGHTELWTGYS-LLYVDGNDYAHNQDL---GSCVPRFSTLPVLSC-GQN 1614 Query: 181 GTCHHFA-NKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVC 318 C++ + N +FWLTT A+P +++ + + +SRC VC Sbjct: 1615 NVCNYASRNDKTFWLTT-----NAAIP-MMPVENIEIRQYISRCVVC 1655 >UniRef50_P08572 Cluster: Collagen alpha-2(IV) chain precursor [Contains: Canstatin]; n=48; Tetrapoda|Rep: Collagen alpha-2(IV) chain precursor [Contains: Canstatin] - Homo sapiens (Human) Length = 1712 Score = 149 bits (360), Expect = 6e-35 Identities = 64/109 (58%), Positives = 73/109 (66%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ + IP CP GW LWIGYSF+MHT L SPGSCLEDFRA PFIECNG Sbjct: 1603 HSQDVSIPHCPAGWRSLWIGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFIECNGGR 1662 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVCIKN 327 GTCH++ANK SFWLTTI + P TLK+G + +SRC VC+KN Sbjct: 1663 GTCHYYANKYSFWLTTIPEQSFQGSPSADTLKAGLIRTHISRCQVCMKN 1711 Score = 63.3 bits (147), Expect = 4e-09 Identities = 37/107 (34%), Positives = 54/107 (50%), Gaps = 1/107 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQT P CPVG ++LW GYS +++ L GSCL F +PF+ CN G Sbjct: 1495 HSQTDQEPMCPVGMNKLWSGYS-LLYFEGQEKAHNQDLGLAGSCLARFSTMPFLYCN-PG 1552 Query: 181 GTCHHFA-NKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVC 318 C++ + N S+WL+T + E + + +SRC+VC Sbjct: 1553 DVCYYASRNDKSYWLSTTAPLPMMPVAEDE------IKPYISRCSVC 1593 >UniRef50_P29400 Cluster: Collagen alpha-5(IV) chain precursor; n=61; Eumetazoa|Rep: Collagen alpha-5(IV) chain precursor - Homo sapiens (Human) Length = 1685 Score = 148 bits (358), Expect = 1e-34 Identities = 63/110 (57%), Positives = 79/110 (71%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQT+ IP CP GW LWIGYSF+MHT LASPGSCLE+FR+ PFIEC+G Sbjct: 1577 HSQTIQIPHCPQGWDSLWIGYSFMMHTSAGAEGSGQALASPGSCLEEFRSAPFIECHGR- 1635 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVCIKNT 330 GTC+++AN SFWL T++ S F+ P+ +TLK+G L R+SRC VC+K T Sbjct: 1636 GTCNYYANSYSFWLATVDVSDMFSKPQSETLKAGDLRTRISRCQVCMKRT 1685 Score = 62.1 bits (144), Expect = 9e-09 Identities = 38/107 (35%), Positives = 52/107 (48%), Gaps = 1/107 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQT D P CP G +++ G+S +++ L + GSCL F +PF+ CN Sbjct: 1467 HSQTTDAPQCPQGTLQVYEGFS-LLYVQGNKRAHGQDLGTAGSCLRRFSTMPFMFCNINN 1525 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMP-ERQTLKSGRLLERVSRCAVC 318 N S+WL+T E MP Q LK + +SRCAVC Sbjct: 1526 VCNFASRNDYSYWLSTPE-----PMPMSMQPLKGQSIQPFISRCAVC 1567 >UniRef50_Q07265 Cluster: 3 alpha procollagen; n=4; Strongylocentrotus purpuratus|Rep: 3 alpha procollagen - Strongylocentrotus purpuratus (Purple sea urchin) Length = 1752 Score = 142 bits (343), Expect = 7e-33 Identities = 61/109 (55%), Positives = 80/109 (73%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQT++IP CP W LWIGYSF+MHT L+SPGSCLEDFR+ PFIEC+G+ Sbjct: 1644 HSQTVNIPDCPDRWGVLWIGYSFMMHTGPGGEGSGQMLSSPGSCLEDFRSSPFIECHGD- 1702 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVCIKN 327 G C+++A +FWL++I + QF MP+ +TLK+G L RVSRCAVC++N Sbjct: 1703 GKCNYYATTYTFWLSSITGNAQFTMPQSETLKAGSLRTRVSRCAVCLRN 1751 Score = 73.7 bits (173), Expect = 3e-12 Identities = 40/107 (37%), Positives = 55/107 (51%), Gaps = 1/107 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQT IP CP G +++W GYS ++ L PGSCL+ F +PF+ CN Sbjct: 1534 HSQTTSIPQCPQGTAKMWHGYS-LLFVQGNERGHGQDLGKPGSCLKRFSTMPFLFCN-IN 1591 Query: 181 GTCHHFA-NKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVC 318 CH + N S+WL+T E P ++ G+L +SRC VC Sbjct: 1592 NVCHVASRNDYSYWLSTTEPMPMNMAP----IRGGQLQPFISRCVVC 1634 >UniRef50_P17139-2 Cluster: Isoform b of P17139 ; n=2; Caenorhabditis elegans|Rep: Isoform b of P17139 - Caenorhabditis elegans Length = 1502 Score = 137 bits (332), Expect = 2e-31 Identities = 59/109 (54%), Positives = 74/109 (67%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ +P CP GWS +W GYSFVMHT L SPGSCLE+FRA+PFIEC+G Sbjct: 1394 HSQDTSVPQCPQGWSGMWTGYSFVMHTAAGAEGTGQSLQSPGSCLEEFRAVPFIECHGR- 1452 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVCIKN 327 GTC+++A FW + ++ +QF P QTLK+G L +RVSRC VC+KN Sbjct: 1453 GTCNYYATNHGFWPSIVDQDKQFRKPMSQTLKAGGLKDRVSRCQVCLKN 1501 Score = 73.3 bits (172), Expect = 4e-12 Identities = 42/107 (39%), Positives = 52/107 (48%), Gaps = 1/107 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQT +P CP G S+LW GYS +++ L PGSCL F +PF+ CN Sbjct: 1284 HSQTTAVPQCPPGASQLWEGYS-LLYVQGNGRASGQDLGQPGSCLSKFNTMPFMFCN-MN 1341 Query: 181 GTCH-HFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVC 318 CH N SFWL+T E P T + +SRCAVC Sbjct: 1342 SVCHVSSRNDYSFWLSTDEPMTPMMNPVTGT----AIRPYISRCAVC 1384 >UniRef50_P55787 Cluster: Collagen alpha-4(IV) chain; n=46; Eumetazoa|Rep: Collagen alpha-4(IV) chain - Oryctolagus cuniculus (Rabbit) Length = 623 Score = 134 bits (325), Expect = 1e-30 Identities = 61/110 (55%), Positives = 71/110 (64%), Gaps = 2/110 (1%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ IP CP W LWIGYSF+MHT L SPGSCLEDFRA PF+EC G Sbjct: 512 HSQDQSIPPCPRAWRSLWIGYSFLMHTGAGDQGGGQALMSPGSCLEDFRAAPFLECQGRQ 571 Query: 181 GTCHHFANKLSFWLTTI-EDSQQFAMPERQTLKSGRL-LERVSRCAVCIK 324 GTCH FAN+ SFWLTT+ D Q F+ P TLK + +++SRC VC+K Sbjct: 572 GTCHFFANEYSFWLTTVPPDLQVFSAPSPDTLKESQAQRQKISRCQVCVK 621 Score = 59.3 bits (137), Expect = 7e-08 Identities = 36/107 (33%), Positives = 48/107 (44%), Gaps = 1/107 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQT P CP+G LW GYS +++ L GSCL F +PF CN Sbjct: 404 HSQTDQEPACPMGMPRLWTGYS-LLYLEGQEKAHNQDLGLAGSCLPIFSTLPFAYCNIH- 461 Query: 181 GTCHHF-ANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVC 318 CH+ N S+WL + + E + + +SRCAVC Sbjct: 462 QVCHYAQRNDKSYWLASAGPLPMMPLSEEE------IRPYISRCAVC 502 >UniRef50_UPI00015B49AB Cluster: PREDICTED: similar to alpha-5 type IV collagen; n=1; Nasonia vitripennis|Rep: PREDICTED: similar to alpha-5 type IV collagen - Nasonia vitripennis Length = 1702 Score = 134 bits (324), Expect = 1e-30 Identities = 59/108 (54%), Positives = 77/108 (71%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ++ IP CP GW ELW GYSF+MH L+SPGSCLE+FRA PFIEC G+ Sbjct: 1490 HSQSMAIPECPGGWEELWAGYSFLMHRDAGAAGGGQPLSSPGSCLEEFRARPFIECRGQ- 1548 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVCIK 324 GTC+ F+ +S+W+ TI+D +QF P++QTLK+ RVSRC+VCI+ Sbjct: 1549 GTCNFFSTAVSYWMATIKDYEQFRKPQQQTLKTDH-TSRVSRCSVCIR 1595 Score = 60.5 bits (140), Expect = 3e-08 Identities = 34/106 (32%), Positives = 51/106 (48%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ+ IP CP ++W G+S ++H L +PGSCL+ F +PF CN Sbjct: 1380 HSQSAMIPVCPRNTVKMWDGFS-LLHVMGNSYAHAQDLGTPGSCLKKFSVMPFNVCNLNN 1438 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVC 318 + N S+WL++ E P + S + +SRC+VC Sbjct: 1439 VCDYANRNDYSYWLSSNEQMPMSMTP----IPSREVGAYISRCSVC 1480 >UniRef50_P53420 Cluster: Collagen alpha-4(IV) chain precursor; n=36; Euteleostomi|Rep: Collagen alpha-4(IV) chain precursor - Homo sapiens (Human) Length = 1690 Score = 134 bits (323), Expect = 2e-30 Identities = 61/110 (55%), Positives = 71/110 (64%), Gaps = 2/110 (1%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ IP CP W LWIGYSF+MHT L SPGSCLEDFRA PF+EC G Sbjct: 1579 HSQDQSIPPCPQTWRSLWIGYSFLMHTGAGDQGGGQALMSPGSCLEDFRAAPFLECQGRQ 1638 Query: 181 GTCHHFANKLSFWLTTIEDSQQF-AMPERQTLKSGRL-LERVSRCAVCIK 324 GTCH FANK SFWLTT++ QF + P TLK + +++SRC VC+K Sbjct: 1639 GTCHFFANKYSFWLTTVKADLQFSSAPAPDTLKESQAQRQKISRCQVCVK 1688 Score = 58.0 bits (134), Expect = 2e-07 Identities = 39/107 (36%), Positives = 48/107 (44%), Gaps = 1/107 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQT P CP+G LW GYS +++ L GSCL F +PF CN Sbjct: 1471 HSQTDQEPTCPLGMPRLWTGYS-LLYLEGQEKAHNQDLGLAGSCLPVFSTLPFAYCNIH- 1528 Query: 181 GTCHHF-ANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVC 318 CH+ N S+WL + MP L + VSRCAVC Sbjct: 1529 QVCHYAQRNDRSYWLASAAPLPM--MP----LSEEAIRPYVSRCAVC 1569 >UniRef50_Q26640 Cluster: Alpha2(IV)-like collagen; n=4; Strongylocentrotus purpuratus|Rep: Alpha2(IV)-like collagen - Strongylocentrotus purpuratus (Purple sea urchin) Length = 1747 Score = 133 bits (322), Expect = 3e-30 Identities = 61/108 (56%), Positives = 74/108 (68%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ+ +IP CP GW LW GYSF M+T L S GSCLE+FRA PFIECNG G Sbjct: 1640 HSQSQEIPQCPGGWRSLWTGYSFTMYTAASEGGGQG-LESVGSCLENFRATPFIECNGRG 1698 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVCIK 324 CH F+N+ SFWLT I++ QFA+P ++T+KSG+L VSRC VC K Sbjct: 1699 N-CHFFSNEYSFWLTVIDEEDQFAIPRKRTIKSGQLQSVVSRCRVCQK 1745 Score = 65.7 bits (153), Expect = 8e-10 Identities = 38/106 (35%), Positives = 53/106 (50%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ+ ++P CP G ELW G+S V+ + L GSCL+ F +PF+ CN Sbjct: 1532 HSQSRNVPSCPAGTVELWRGFS-VLFSMGNGHAHHQDLGDAGSCLQRFSTMPFLFCNFNN 1590 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVC 318 + N S+WLTT E MP L + ++ +SRC VC Sbjct: 1591 VCNYASRNDRSYWLTTNEPLPM--MP----LMNQQIDPYISRCTVC 1630 >UniRef50_Q4SZ69 Cluster: Chromosome undetermined SCAF11805, whole genome shotgun sequence; n=2; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF11805, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 471 Score = 105 bits (251), Expect(2) = 1e-29 Identities = 46/85 (54%), Positives = 58/85 (68%) Frame = +1 Query: 76 HTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEGGTCHHFANKLSFWLTTIEDSQQFAM 255 HT LASPGSCLE+FR+ PFIEC+G G TC+++ N SFWL T+E S+ F Sbjct: 388 HTSAGAEGSGQALASPGSCLEEFRSAPFIECHGRG-TCNYYGNSYSFWLATVEPSEMFRK 446 Query: 256 PERQTLKSGRLLERVSRCAVCIKNT 330 P+ +TLK+G L RVSRC VC+K T Sbjct: 447 PQSETLKAGNLQTRVSRCVVCMKRT 471 Score = 58.4 bits (135), Expect = 1e-07 Identities = 36/107 (33%), Positives = 52/107 (48%), Gaps = 1/107 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ D+P CP G + ++ GYS +++ L + GSCL F +PF+ CN Sbjct: 213 HSQAQDVPYCPDGTNLIYDGYS-LLYVQGNERAHGQDLGTAGSCLRRFSTMPFMFCNINN 271 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLER-VSRCAVC 318 N S+WL+T E MP +G ++ +SRCAVC Sbjct: 272 VCNFASRNDYSYWLSTPE-----PMPMSMAPITGESIKPFISRCAVC 313 Score = 47.2 bits (107), Expect(2) = 1e-29 Identities = 17/25 (68%), Positives = 19/25 (76%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVM 75 HSQT+ IP CP W LWIGYSF+M Sbjct: 323 HSQTIQIPTCPANWEALWIGYSFMM 347 >UniRef50_Q28084 Cluster: Collagen alpha-3(IV) chain; n=13; cellular organisms|Rep: Collagen alpha-3(IV) chain - Bos taurus (Bovine) Length = 471 Score = 129 bits (312), Expect = 4e-29 Identities = 56/108 (51%), Positives = 72/108 (66%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQT DIP CP GW LW G+SF+M T LASPGSCLE+FRA PFIEC+G Sbjct: 362 HSQTTDIPPCPAGWISLWKGFSFIMFTSAGSEGAGQALASPGSCLEEFRASPFIECHGR- 420 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVCIK 324 GTC++++N SFWL +++ + F P T+K+G L +SRC VC+K Sbjct: 421 GTCNYYSNSYSFWLASLDPKRMFRKPIPSTVKAGELENIISRCQVCMK 468 Score = 57.6 bits (133), Expect = 2e-07 Identities = 37/107 (34%), Positives = 50/107 (46%), Gaps = 1/107 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQT IP CP G L+ G+S ++ L + GSCL+ F +PF+ CN Sbjct: 252 HSQTTAIPSCPEGTEPLYSGFS-LLFVQGNEQAHGQDLGTLGSCLQRFTTMPFLFCNIND 310 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLE-RVSRCAVC 318 N S+WL+T +P +GR LE +SRC VC Sbjct: 311 VCNFASRNDYSYWLST-----PAMIPMDMAPITGRALEPYISRCTVC 352 >UniRef50_Q01955 Cluster: Collagen alpha-3(IV) chain precursor (Goodpasture antigen) [Contains: Tumstatin]; n=61; Eumetazoa|Rep: Collagen alpha-3(IV) chain precursor (Goodpasture antigen) [Contains: Tumstatin] - Homo sapiens (Human) Length = 1670 Score = 127 bits (306), Expect = 2e-28 Identities = 55/108 (50%), Positives = 72/108 (66%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQT DIP CP GW LW G+SF+M T LASPGSCLE+FRA PF+EC+G Sbjct: 1561 HSQTTDIPPCPHGWISLWKGFSFIMFTSAGSEGTGQALASPGSCLEEFRASPFLECHGR- 1619 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVCIK 324 GTC++++N SFWL ++ + F P T+K+G L + +SRC VC+K Sbjct: 1620 GTCNYYSNSYSFWLASLNPERMFRKPIPSTVKAGELEKIISRCQVCMK 1667 Score = 61.7 bits (143), Expect = 1e-08 Identities = 39/107 (36%), Positives = 50/107 (46%), Gaps = 1/107 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQT IP CP G L+ G+SF + L + GSCL+ F +PF+ CN Sbjct: 1451 HSQTTAIPSCPEGTVPLYSGFSF-LFVQGNQRAHGQDLGTLGSCLQRFTTMPFLFCNVND 1509 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLE-RVSRCAVC 318 N S+WL+T MP +GR LE +SRC VC Sbjct: 1510 VCNFASRNDYSYWLST-----PALMPMNMAPITGRALEPYISRCTVC 1551 >UniRef50_Q58FS7 Cluster: Type IV collagen alpha 3 chain; n=2; Danio rerio|Rep: Type IV collagen alpha 3 chain - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 244 Score = 126 bits (303), Expect = 5e-28 Identities = 57/108 (52%), Positives = 71/108 (65%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQT++IP CPVGW LW GYSFVM T L SPGSCLE FR IPFIEC+G Sbjct: 133 HSQTINIPQCPVGWLSLWEGYSFVMQTGVGAEGSGQPLVSPGSCLEQFRKIPFIECHGR- 191 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVCIK 324 GTC+ + + S+WL +++ + F+MP RQT K E +SRC VC+K Sbjct: 192 GTCNFYPDSYSYWLASLDHTNMFSMPNRQTAKQ---KEIISRCQVCMK 236 Score = 65.3 bits (152), Expect = 1e-09 Identities = 41/108 (37%), Positives = 61/108 (56%), Gaps = 2/108 (1%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQT IP CP G L+ GYS ++ L + GSCL F +PF+ CN + Sbjct: 23 HSQTTVIPECPAGSKRLYTGYS-LLFINGNNRGHGQDLGTLGSCLPMFNTMPFMVCNRD- 80 Query: 181 GTCHHFA-NKLSFWLTTIEDSQQFAMPERQTLKSGRLLE-RVSRCAVC 318 TC + + N S+WL+T D+ +P++Q + SG +L+ +SRC+VC Sbjct: 81 ETCRYASRNDYSYWLST--DTPM--LPDQQ-MMSGEILKWYISRCSVC 123 >UniRef50_Q4S0I4 Cluster: Chromosome 2 SCAF14781, whole genome shotgun sequence; n=5; Euteleostomi|Rep: Chromosome 2 SCAF14781, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 1468 Score = 122 bits (293), Expect = 8e-27 Identities = 50/78 (64%), Positives = 58/78 (74%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ + IP CPVGW LWIGYSF+MHT L+SPGSCLEDFR PFIECNG Sbjct: 1390 HSQDITIPQCPVGWRSLWIGYSFLMHTAAGNEGGGQSLSSPGSCLEDFRTTPFIECNGAK 1449 Query: 181 GTCHHFANKLSFWLTTIE 234 GTCH+FANK SFWL++++ Sbjct: 1450 GTCHYFANKHSFWLSSVD 1467 Score = 65.7 bits (153), Expect = 8e-10 Identities = 39/107 (36%), Positives = 56/107 (52%), Gaps = 1/107 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQT IP CPVG ++LW GYS +++ L GSCL F +PF+ CN G Sbjct: 1282 HSQTEQIPMCPVGMAKLWSGYS-LLYMEGQEKAHNQDLGLAGSCLPRFSTMPFLYCN-PG 1339 Query: 181 GTCHHFA-NKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVC 318 C++ + N S+WL+T MP ++ + +SRC+VC Sbjct: 1340 DICYYASRNDKSYWLSTTAPLPM--MP----VEDVEIKPYISRCSVC 1380 >UniRef50_Q7PVR6 Cluster: ENSANGP00000016652; n=3; Endopterygota|Rep: ENSANGP00000016652 - Anopheles gambiae str. PEST Length = 461 Score = 122 bits (293), Expect = 8e-27 Identities = 54/106 (50%), Positives = 69/106 (65%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ++ IP CP GW ELW+GYS+ MHT SPGSC+E+FR P IEC+G Sbjct: 300 HSQSMSIPDCPEGWEELWLGYSYAMHTSDNSGGFGQDFVSPGSCMEEFRPQPVIECHGH- 358 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVC 318 GTC+ + SFWLT I+D+ QF P+ QTLK+ + +VSRC VC Sbjct: 359 GTCNFYDGISSFWLTIIDDAMQFNRPQPQTLKAHQ-TSKVSRCIVC 403 Score = 60.9 bits (141), Expect = 2e-08 Identities = 36/111 (32%), Positives = 54/111 (48%), Gaps = 1/111 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ + IP CP+ +LW GYS V + L + GSCL F +PF+ C+ Sbjct: 190 HSQKVTIPECPINTYKLWDGYSLV-NVIASSRSVGQDLGAAGSCLRRFSTMPFMFCD-IN 247 Query: 181 GTCHHFANK-LSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVCIKNT 330 C++ +N + WL T E P + + ++ +SRC+VC NT Sbjct: 248 NVCNYASNNDDTIWLATPEPMPMSMAP----IPADQVERYISRCSVCESNT 294 >UniRef50_Q9VMV5 Cluster: CG16858-PA; n=6; Schizophora|Rep: CG16858-PA - Drosophila melanogaster (Fruit fly) Length = 1940 Score = 120 bits (288), Expect = 3e-26 Identities = 53/106 (50%), Positives = 67/106 (63%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ++ IP CP GW E+W GYS+ M T L SPGSCLE+FRA P IEC+G Sbjct: 1631 HSQSMSIPDCPGGWEEMWTGYSYFMSTLDNVGGVGQNLVSPGSCLEEFRAQPVIECHGH- 1689 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVC 318 G C+++ SFWLT IE+ QF P +QTLK+ ++SRC VC Sbjct: 1690 GRCNYYDALASFWLTVIEEQDQFVQPRQQTLKAD-FTSKISRCTVC 1734 Score = 57.6 bits (133), Expect = 2e-07 Identities = 40/113 (35%), Positives = 54/113 (47%), Gaps = 3/113 (2%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ++ +P CP + LW GYS + L GSC+ F +P++ C+ Sbjct: 1521 HSQSVHVPQCPANTNLLWEGYS-LSGNVAASRAVGQDLGQSGSCMMRFTTMPYMLCD-IT 1578 Query: 181 GTCHHFA--NKLSFWLTTIEDSQQFAMPERQTLKSGR-LLERVSRCAVCIKNT 330 C HFA N S WL+T E MP T GR L++ +SRC VC T Sbjct: 1579 NVC-HFAQNNDDSLWLSTAE-----PMPMTMTPIQGRDLMKYISRCVVCETTT 1625 >UniRef50_UPI000065E567 Cluster: Homolog of Brachydanio rerio "Collagen, type I, alpha 3.; n=1; Takifugu rubripes|Rep: Homolog of Brachydanio rerio "Collagen, type I, alpha 3. - Takifugu rubripes Length = 1426 Score = 116 bits (280), Expect = 3e-25 Identities = 51/109 (46%), Positives = 66/109 (60%), Gaps = 1/109 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ P CP GW LW GYSF+MHT L S GSCL++F+ P IEC G Sbjct: 1318 HSQEHTAPACPQGWRSLWTGYSFLMHTGAGDEGSGQALTSSGSCLKNFQTHPIIECQGPQ 1377 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKS-GRLLERVSRCAVCIK 324 G+CH+F+N SFWLTTI ++QF P T+K+ R + S+C VC++ Sbjct: 1378 GSCHYFSNLYSFWLTTISPTEQFKAPRPGTIKAPDRQRSKTSQCHVCLR 1426 Score = 59.7 bits (138), Expect = 5e-08 Identities = 37/107 (34%), Positives = 50/107 (46%), Gaps = 1/107 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ++ +P CP G S LW+GYS + + L GSCL F +PF CN Sbjct: 1211 HSQSVQVPKCPDGSSLLWVGYS-LAYLKGQKNAHAQDLGQAGSCLRVFSTMPFSYCN--K 1267 Query: 181 GTCHHFA-NKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVC 318 CH + N S+WL+T MP + + +SRC VC Sbjct: 1268 AACHFSSRNDKSYWLSTAAPIPM--MP----VFGQEISSHISRCVVC 1308 >UniRef50_UPI000065E566 Cluster: Homolog of Homo sapiens "Splice Isoform 1 of Collagen alpha 3; n=1; Takifugu rubripes|Rep: Homolog of Homo sapiens "Splice Isoform 1 of Collagen alpha 3 - Takifugu rubripes Length = 1258 Score = 115 bits (277), Expect = 7e-25 Identities = 51/107 (47%), Positives = 64/107 (59%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQT IP CP W LW GYSFVM T L SPGSCLE FR +PFIEC+G Sbjct: 1153 HSQTTQIPDCPQDWVSLWSGYSFVMQTGIGADGSSQPLISPGSCLETFRKVPFIECHGR- 1211 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVCI 321 GTC+++ + SFW+ +++ F P QT+K L +SRC VC+ Sbjct: 1212 GTCNYYPDSYSFWMASLDPKNMFGKPIPQTVKEPSLQSILSRCRVCM 1258 Score = 73.3 bits (172), Expect = 4e-12 Identities = 43/108 (39%), Positives = 59/108 (54%), Gaps = 2/108 (1%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFV-MHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGE 177 HSQ++ IP CP G S L+ GYSF+ MH L +PGSCL F +PF+ C+ E Sbjct: 1043 HSQSIHIPVCPCGTSLLFSGYSFLFMHANDRVHGQD--LGTPGSCLPHFSTMPFLVCDTE 1100 Query: 178 GGTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLL-ERVSRCAVC 318 + N S+WL+T + A+PE +G +L +SRCAVC Sbjct: 1101 SNCRYASRNDYSYWLSTGK-----ALPENMVSITGDMLASYISRCAVC 1143 >UniRef50_UPI00006608B5 Cluster: Homolog of Homo sapiens "Tumstatin; n=1; Takifugu rubripes|Rep: Homolog of Homo sapiens "Tumstatin - Takifugu rubripes Length = 1374 Score = 114 bits (275), Expect = 1e-24 Identities = 51/108 (47%), Positives = 65/108 (60%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQT +P CP+GW LW+GYSFVM T LASPGSCLE FR IPFIEC+G Sbjct: 1267 HSQTSVVPDCPLGWLPLWVGYSFVMETGVGAEGSGQPLASPGSCLEQFRKIPFIECHGR- 1325 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVCIK 324 GTC+++ + S+WL + F+ P+ T +SRC VC+K Sbjct: 1326 GTCNYYTDSYSYWLAALSPHDMFSKPKPHTDTGEFPGSLISRCRVCMK 1373 Score = 66.9 bits (156), Expect = 3e-10 Identities = 39/111 (35%), Positives = 56/111 (50%), Gaps = 1/111 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ L IP CPVG +E++ GYS ++ L + GSCL F +PF+ CN + Sbjct: 1157 HSQELSIPECPVGSTEVYSGYS-LLFINGNNRAHGQDLGTLGSCLPRFTTMPFLFCNTD- 1214 Query: 181 GTCHHFA-NKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVCIKNT 330 TC + + N S+WL+T Q + + L +SRC+VC T Sbjct: 1215 STCRYASRNDYSYWLST----NQVVLSNMPLISGDLLRSYISRCSVCETRT 1261 >UniRef50_Q9GQB1 Cluster: Type IV collagen alpha 1 chain precursor; n=1; Hydra vulgaris|Rep: Type IV collagen alpha 1 chain precursor - Hydra attenuata (Hydra) (Hydra vulgaris) Length = 1723 Score = 111 bits (268), Expect = 9e-24 Identities = 53/109 (48%), Positives = 70/109 (64%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ+ P CP GW LW G+SF+M+ L+S GSCLEDFR P+IEC+G Sbjct: 1615 HSQSELDPKCPDGWENLWTGFSFLMYNSAGAQGSGQLLSSSGSCLEDFRVNPYIECHGR- 1673 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVCIKN 327 GTC ++ LSFWL+TI +S F +P+ + L+ L RVSRCAVC+K+ Sbjct: 1674 GTCWYYGPTLSFWLSTIGESNMFQVPKFEILER-NLKARVSRCAVCMKS 1721 Score = 69.7 bits (163), Expect = 5e-11 Identities = 36/106 (33%), Positives = 50/106 (47%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ++ +P CP G +W GYSF ++ L PGSCL+ F +PF+ C+ + Sbjct: 1507 HSQSIKVPSCPAGMQTMWEGYSF-LYAQGNERAFGQDLGQPGSCLKRFSTMPFLFCDIQN 1565 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVC 318 N SFWL+T E P+ L +SRC VC Sbjct: 1566 KCVVASRNDYSFWLSTAE------KPKEAPSSGADLENYISRCIVC 1605 >UniRef50_UPI00015A592A Cluster: Type IV collagen alpha 4 chain; n=3; Danio rerio|Rep: Type IV collagen alpha 4 chain - Danio rerio Length = 1639 Score = 94.7 bits (225), Expect = 1e-18 Identities = 45/110 (40%), Positives = 61/110 (55%), Gaps = 1/110 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ P CP W LW G+SF+M+T L S GSCL+DFR+ PF+EC G Sbjct: 1530 HSQDRLDPVCPPKWRSLWTGFSFMMYTGSGDEGGGQSLTSTGSCLQDFRSQPFVECQGPR 1589 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGR-LLERVSRCAVCIKN 327 GTC +FA+ SFW+ TI+ + P + R + SRC++C+ N Sbjct: 1590 GTCSYFASIYSFWM-TIDMEHNDSSPHGPVITEERQQRDSTSRCSICMMN 1638 Score = 62.5 bits (145), Expect = 7e-09 Identities = 38/107 (35%), Positives = 55/107 (51%), Gaps = 1/107 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ+ +P CP G ++LW GYS +++ L GSCL F +PF CN + Sbjct: 1423 HSQSRYVPTCPAGLTQLWNGYS-LLYLEGQERAHTQDLGQAGSCLPVFSTMPFSCCNMD- 1480 Query: 181 GTCHHFA-NKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVC 318 TC + + N S+WL+T +P + LK + E +SRC VC Sbjct: 1481 -TCDYASRNDKSYWLST-----NAPIPNK-PLKGQDIEEHISRCVVC 1520 >UniRef50_Q4TZW9 Cluster: Type IV collagen alpha 4 chain; n=3; Danio rerio|Rep: Type IV collagen alpha 4 chain - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 240 Score = 94.7 bits (225), Expect = 1e-18 Identities = 45/110 (40%), Positives = 61/110 (55%), Gaps = 1/110 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ P CP W LW G+SF+M+T L S GSCL+DFR+ PF+EC G Sbjct: 128 HSQDRLDPVCPPKWRNLWTGFSFMMYTGSGDEGGGQSLTSTGSCLQDFRSQPFVECQGPR 187 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGR-LLERVSRCAVCIKN 327 GTC +FA+ SFW+ TI+ + P + R + SRC++C+ N Sbjct: 188 GTCSYFASIYSFWM-TIDMEHNDSSPHGPVITEERQQRDSTSRCSICMMN 236 Score = 62.5 bits (145), Expect = 7e-09 Identities = 38/107 (35%), Positives = 55/107 (51%), Gaps = 1/107 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ+ +P CP G ++LW GYS +++ L GSCL F +PF CN + Sbjct: 21 HSQSRYVPTCPAGLTQLWNGYS-LLYLEGQERAHTQDLGQAGSCLPVFSTMPFSCCNMD- 78 Query: 181 GTCHHFA-NKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVC 318 TC + + N S+WL+T +P + LK + E +SRC VC Sbjct: 79 -TCDYASRNDKSYWLST-----NAPIPNK-PLKGQDIEEHISRCVVC 118 >UniRef50_Q5C3P1 Cluster: SJCHGC06113 protein; n=2; Platyhelminthes|Rep: SJCHGC06113 protein - Schistosoma japonicum (Blood fluke) Length = 587 Score = 86.6 bits (205), Expect = 4e-16 Identities = 43/111 (38%), Positives = 62/111 (55%), Gaps = 3/111 (2%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ + CP W+ELW G S ++HT L+SPGSC+E FR P IECN Sbjct: 475 HSQGETLQPCPSTWTELWTGVSLILHTSGAHGGGQQ-LSSPGSCMEHFRYSPVIECNNNV 533 Query: 181 GTCHHFANKLSFWLTTIEDS-QQFAMPERQTLKS--GRLLERVSRCAVCIK 324 G CH++++ ++L + + QF P +K+ G +L VS+C VC+K Sbjct: 534 GMCHYWSDAKVYYLRALNPNITQFEKPVGFVMKAAEGPVLNNVSKCRVCMK 584 Score = 53.2 bits (122), Expect = 4e-06 Identities = 28/97 (28%), Positives = 47/97 (48%) Frame = +1 Query: 28 CPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEGGTCHHFANK 207 CP G ++L+ GYS+VM L +P SCL F ++P +C + ++ Sbjct: 376 CPGGTNKLFTGYSYVMG-GGVDDLVSMDLGTPSSCLSKFSSLPMTQCERDTTCQSSMRHE 434 Query: 208 LSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVC 318 S+WL T+ + +P QT ++++RC VC Sbjct: 435 RSYWLATLVPRSEQPIPVNQT------ADQIARCVVC 465 >UniRef50_O09238 Cluster: Collagen type IV; n=2; Pseudocorticium jarrei|Rep: Collagen type IV - Pseudocorticium jarrei Length = 854 Score = 84.2 bits (199), Expect = 2e-15 Identities = 40/106 (37%), Positives = 55/106 (51%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ ++P C GW LW G+SF+ T L SPGSCL+ FR+ PFI C G Sbjct: 743 HSQDSNVPDCFPGWVTLWTGFSFLQQTAAQAEGTGQGLESPGSCLQHFRSTPFIGCGGR- 801 Query: 181 GTCHHFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVC 318 G C + + S+W+ ++ F E T + +R+SRC VC Sbjct: 802 GQCSYDSVSGSYWMIVLDALNPFQDTEPGTYPVSDIEKRLSRCRVC 847 Score = 63.7 bits (148), Expect = 3e-09 Identities = 37/111 (33%), Positives = 56/111 (50%), Gaps = 1/111 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQT +IP CP ++ LW+GYS + T L PGSC+ F +P + CN Sbjct: 635 HSQTTNIPQCPNDYTRLWVGYSLLQLT-GNGLGVGQDLGDPGSCMPSFHPMPVVRCN-PM 692 Query: 181 GTCHHFANK-LSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVCIKNT 330 C K S+WL+T ++ + +P + + E +SRC+VC N+ Sbjct: 693 QRCEFARRKDESYWLST--NATRPPIP----VSGSDIEEHISRCSVCESNS 737 >UniRef50_UPI0000DA44CD Cluster: PREDICTED: similar to procollagen, type IV, alpha 6; n=1; Rattus norvegicus|Rep: PREDICTED: similar to procollagen, type IV, alpha 6 - Rattus norvegicus Length = 1405 Score = 68.1 bits (159), Expect = 1e-10 Identities = 39/107 (36%), Positives = 56/107 (52%), Gaps = 1/107 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ+ +P CP+G S+LW+GYS ++ L GSCL F +PFI CN Sbjct: 1254 HSQSEHVPPCPIGMSQLWVGYS-LLFVEGQEKAHNQDLGFAGSCLPRFSTMPFIYCN-IN 1311 Query: 181 GTCHHF-ANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVC 318 CH+ N S+WL+T MP +T ++ + +SRC+VC Sbjct: 1312 EVCHYARRNDKSYWLSTTAPIPM--MPVGET----QIPQYISRCSVC 1352 >UniRef50_UPI0000DBF028 Cluster: UPI0000DBF028 related cluster; n=9; Rattus norvegicus|Rep: UPI0000DBF028 UniRef100 entry - Rattus norvegicus Length = 1549 Score = 68.1 bits (159), Expect = 1e-10 Identities = 39/107 (36%), Positives = 56/107 (52%), Gaps = 1/107 (0%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ+ +P CP+G S+LW+GYS ++ L GSCL F +PFI CN Sbjct: 1418 HSQSEHVPPCPIGMSQLWVGYS-LLFVEGQEKAHNQDLGFAGSCLPRFSTMPFIYCN-IN 1475 Query: 181 GTCHHF-ANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVC 318 CH+ N S+WL+T MP +T ++ + +SRC+VC Sbjct: 1476 EVCHYARRNDKSYWLSTTAPIPM--MPVGET----QIPQYISRCSVC 1516 Score = 41.5 bits (93), Expect = 0.014 Identities = 16/25 (64%), Positives = 19/25 (76%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVM 75 HSQ +P CP+GW LWIGYSF+M Sbjct: 1526 HSQDT-VPQCPLGWHSLWIGYSFLM 1549 >UniRef50_Q5BYE6 Cluster: SJCHGC08138 protein; n=1; Schistosoma japonicum|Rep: SJCHGC08138 protein - Schistosoma japonicum (Blood fluke) Length = 206 Score = 57.2 bits (132), Expect = 3e-07 Identities = 37/112 (33%), Positives = 50/112 (44%), Gaps = 2/112 (1%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGE- 177 HSQ P CP+ + ++ GYS V L +PGSCL F +PF C + Sbjct: 99 HSQDSQPPSCPIYTTPVYTGYSLVT-LQGDDDSTTMDLGTPGSCLRKFSIMPFANCFAKV 157 Query: 178 GGTCH-HFANKLSFWLTTIEDSQQFAMPERQTLKSGRLLERVSRCAVCIKNT 330 G C + N S+WL+T+E Q P R + +SRC VC T Sbjct: 158 NGNCQINMRNGRSYWLSTLE--QYMLSPARVE----NIKPYISRCIVCQSRT 203 >UniRef50_Q4SB07 Cluster: Chromosome undetermined SCAF14677, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF14677, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 856 Score = 54.0 bits (124), Expect = 2e-06 Identities = 29/77 (37%), Positives = 43/77 (55%), Gaps = 1/77 (1%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECNGEG 180 HSQ L IP CP G ++++ GYS ++ L + GSCL F +PF+ CN + Sbjct: 741 HSQELYIPECPAGSTQVYSGYS-LLFINGNNRAHGQDLGTLGSCLPRFTTMPFLFCNTD- 798 Query: 181 GTCHHFA-NKLSFWLTT 228 TC + + N S+WL+T Sbjct: 799 RTCRYASRNDYSYWLST 815 >UniRef50_A7T795 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 331 Score = 46.8 bits (106), Expect = 4e-04 Identities = 21/57 (36%), Positives = 30/57 (52%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECN 171 HSQT P CP + +LW GYS +++ L GSCL+ F +P++ CN Sbjct: 262 HSQTTTPPECPPTYDKLWDGYS-LLYVQGHDVSHGQDLGQAGSCLKRFTTMPYLYCN 317 >UniRef50_A7T3G2 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 590 Score = 46.8 bits (106), Expect = 4e-04 Identities = 21/57 (36%), Positives = 30/57 (52%) Frame = +1 Query: 1 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLEDFRAIPFIECN 171 HSQT P CP + +LW GYS +++ L GSCL+ F +P++ CN Sbjct: 481 HSQTTTPPECPPTYDKLWDGYS-LLYVQGHDVSHGQDLGQAGSCLKRFTTMPYLYCN 536 >UniRef50_A1ZSZ5 Cluster: Putative uncharacterized protein; n=1; Microscilla marina ATCC 23134|Rep: Putative uncharacterized protein - Microscilla marina ATCC 23134 Length = 256 Score = 33.9 bits (74), Expect = 2.8 Identities = 22/53 (41%), Positives = 30/53 (56%), Gaps = 2/53 (3%) Frame = -3 Query: 311 AHLDTRSRRRPDLSVCRSGIANCWL--SSMVVSQKLSLFAK*WQVPPSPLHSI 159 A L + RRR L R+G+ WL + +V+Q L L W+ PP+PLHSI Sbjct: 26 ALLTKKPRRRRQL--LRTGVFLLWLFTNPFIVNQALLL----WEAPPTPLHSI 72 >UniRef50_Q24BC9 Cluster: Putative uncharacterized protein; n=1; Tetrahymena thermophila SB210|Rep: Putative uncharacterized protein - Tetrahymena thermophila SB210 Length = 474 Score = 33.1 bits (72), Expect = 5.0 Identities = 31/103 (30%), Positives = 52/103 (50%), Gaps = 3/103 (2%) Frame = +2 Query: 185 LAITLQINSVSG*PPSKTANSLL-CQSDRHLSLDVS*NVYPDVQFVSKTPHSANKLKVLN 361 L I+L +N ++ S + L C HLSL + N D+ VS+ S ++ + L+ Sbjct: 112 LVISLNMNKITSIGSSNLISQLTKCNKLSHLSLFFNNNQIGDLG-VSQFSSSLSQFQKLS 170 Query: 362 NQMLI*PLMQYLFKIQKK-LIDHICNTLFKFLVIN-YSQETTL 484 + Q+ +IQ+ L+ IC+ +FK+L+ N YS T L Sbjct: 171 KLLFGLKEHQHQVQIQQNALLFQICDFIFKYLLQNLYSINTKL 213 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 568,007,212 Number of Sequences: 1657284 Number of extensions: 10637436 Number of successful extensions: 19812 Number of sequences better than 10.0: 35 Number of HSP's better than 10.0 without gapping: 19226 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 19744 length of database: 575,637,011 effective HSP length: 96 effective length of database: 416,537,747 effective search space used: 40820699206 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -