BLASTX 2.2.12 [Aug-07-2005]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= I10A02NGRL0003_M17
(530 letters)
Database: uniref50
1,657,284 sequences; 575,637,011 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef50_UPI0000DB7985 Cluster: PREDICTED: similar to Collagen t... 248 6e-65
UniRef50_P08120 Cluster: Collagen alpha-1(IV) chain precursor; n... 244 1e-63
UniRef50_Q4S0I4 Cluster: Chromosome 2 SCAF14781, whole genome sh... 239 2e-62
UniRef50_P08572 Cluster: Collagen alpha-2(IV) chain precursor [C... 231 8e-60
UniRef50_UPI0000613E3C Cluster: Collagen alpha-2(IV) chain; n=2;... 229 3e-59
UniRef50_Q4SZ73 Cluster: Chromosome undetermined SCAF11805, whol... 228 7e-59
UniRef50_P29400 Cluster: Collagen alpha-5(IV) chain precursor; n... 227 9e-59
UniRef50_Q07265 Cluster: 3 alpha procollagen; n=4; Strongylocent... 221 8e-57
UniRef50_P55787 Cluster: Collagen alpha-4(IV) chain; n=46; Eumet... 220 1e-56
UniRef50_P53420 Cluster: Collagen alpha-4(IV) chain precursor; n... 217 1e-55
UniRef50_P17139-2 Cluster: Isoform b of P17139 ; n=2; Caenorhabd... 215 7e-55
UniRef50_Q4SZ69 Cluster: Chromosome undetermined SCAF11805, whol... 208 1e-53
UniRef50_Q28084 Cluster: Collagen alpha-3(IV) chain; n=13; cellu... 210 2e-53
UniRef50_UPI0000DBF028 Cluster: UPI0000DBF028 related cluster; n... 209 3e-53
UniRef50_Q26640 Cluster: Alpha2(IV)-like collagen; n=4; Strongyl... 209 3e-53
UniRef50_Q01955 Cluster: Collagen alpha-3(IV) chain precursor (G... 207 1e-52
UniRef50_Q7PVR6 Cluster: ENSANGP00000016652; n=3; Endopterygota|... 202 4e-51
UniRef50_UPI00015B49AB Cluster: PREDICTED: similar to alpha-5 ty... 200 1e-50
UniRef50_UPI00015A592A Cluster: Type IV collagen alpha 4 chain; ... 200 2e-50
UniRef50_Q4TZW9 Cluster: Type IV collagen alpha 4 chain; n=3; Da... 200 2e-50
UniRef50_Q58FS7 Cluster: Type IV collagen alpha 3 chain; n=2; Da... 199 3e-50
UniRef50_UPI00006608B5 Cluster: Homolog of Homo sapiens "Tumstat... 198 9e-50
UniRef50_UPI000065E567 Cluster: Homolog of Brachydanio rerio "Co... 195 6e-49
UniRef50_Q9GQB1 Cluster: Type IV collagen alpha 1 chain precurso... 189 3e-47
UniRef50_UPI000065E566 Cluster: Homolog of Homo sapiens "Splice ... 188 7e-47
UniRef50_Q9VMV5 Cluster: CG16858-PA; n=6; Schizophora|Rep: CG168... 186 4e-46
UniRef50_UPI0000DA44CD Cluster: PREDICTED: similar to procollage... 179 3e-44
UniRef50_O09238 Cluster: Collagen type IV; n=2; Pseudocorticium ... 159 4e-38
UniRef50_Q5C3P1 Cluster: SJCHGC06113 protein; n=2; Platyhelminth... 131 1e-29
UniRef50_Q4SB07 Cluster: Chromosome undetermined SCAF14677, whol... 118 8e-26
UniRef50_A7T3G2 Cluster: Predicted protein; n=1; Nematostella ve... 109 5e-23
UniRef50_A7T795 Cluster: Predicted protein; n=1; Nematostella ve... 108 9e-23
UniRef50_Q5BYE6 Cluster: SJCHGC08138 protein; n=1; Schistosoma j... 93 3e-18
UniRef50_Q4WVM5 Cluster: DNA polymerase gamma; n=13; Pezizomycot... 33 4.1
UniRef50_A4RDD1 Cluster: Putative uncharacterized protein; n=1; ... 33 5.4
UniRef50_Q4T5R1 Cluster: Chromosome undetermined SCAF9151, whole... 32 9.5
UniRef50_A1G1L8 Cluster: Glycoside hydrolase family 2, immunoglo... 32 9.5
UniRef50_A0CNR1 Cluster: Chromosome undetermined scaffold_22, wh... 32 9.5
>UniRef50_UPI0000DB7985 Cluster: PREDICTED: similar to Collagen type
IV CG4145-PA, isoform A isoform 1; n=1; Apis
mellifera|Rep: PREDICTED: similar to Collagen type IV
CG4145-PA, isoform A isoform 1 - Apis mellifera
Length = 1913
Score = 248 bits (607), Expect = 6e-65
Identities = 115/188 (61%), Positives = 140/188 (74%), Gaps = 24/188 (12%)
Frame = +2
Query: 38 DYLTGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLG------------ 181
DYLTGILLV+HSQ +++P C+ GH+KLW+GYSLL+ DG+E+AH+QDLG
Sbjct: 1662 DYLTGILLVKHSQSQLLPVCDAGHIKLWEGYSLLFTDGDERAHSQDLGKSETYIAIDSKF 1721
Query: 182 ------------YAGSCVRKFSTMPFLFCDLNDVCNYASRNDRSYWLSTGQPIPMMPVEG 325
YAGSCVRKFSTMPFLFCD+N+VC+Y +R DRSYWLST PIPMMPV+
Sbjct: 1722 FPRFSYDLVPFRYAGSCVRKFSTMPFLFCDINNVCHYGNRGDRSYWLSTTSPIPMMPVQE 1781
Query: 326 NEIVKYISRCVVCEVPSNVIAVHSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXL 505
+EI +YISRCVVCEVP+NV+AVHSQ+L+IP CP GW+ LWIGYSF+MHT L
Sbjct: 1782 SEIEQYISRCVVCEVPANVLAVHSQSLNIPDCPQGWTGLWIGYSFLMHTGAGAQGGGQSL 1841
Query: 506 ASPGSCLE 529
+S GSCLE
Sbjct: 1842 SSSGSCLE 1849
Score = 58.4 bits (135), Expect = 1e-07
Identities = 36/111 (32%), Positives = 53/111 (47%), Gaps = 8/111 (7%)
Frame = +2
Query: 56 LLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFC 232
+L HSQ +P C G LW GYS L++ + Q L +GSC+ F PF+ C
Sbjct: 1800 VLAVHSQSLNIPDCPQGWTGLWIGYSFLMHTGAGAQGGGQSLSSSGSCLEDFRATPFIEC 1859
Query: 233 DLN-DVCNYASRNDRSYWLST------GQPIPMMPVEGNEIVKYISRCVVC 364
+ N C+Y N+ S+W++T Q ++ + ISRC VC
Sbjct: 1860 NGNKGQCHY-YMNEISFWMATIEDRQQFQAPEQQTLKAGNLRSKISRCQVC 1909
>UniRef50_P08120 Cluster: Collagen alpha-1(IV) chain precursor; n=5;
Diptera|Rep: Collagen alpha-1(IV) chain precursor -
Drosophila melanogaster (Fruit fly)
Length = 1775
Score = 244 bits (597), Expect = 1e-63
Identities = 108/164 (65%), Positives = 125/164 (76%)
Frame = +2
Query: 38 DYLTGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTM 217
DYLTGIL+ RHSQ E VP C GH +LW GYSLLY+DGN+ AHNQDLG SCV +FST+
Sbjct: 1550 DYLTGILITRHSQSETVPACSAGHTELWTGYSLLYVDGNDYAHNQDLG---SCVPRFSTL 1606
Query: 218 PFLFCDLNDVCNYASRNDRSYWLSTGQPIPMMPVEGNEIVKYISRCVVCEVPSNVIAVHS 397
P L C N+VCNYASRND+++WL+T IPMMPVE EI +YISRCVVCE P+NVIAVHS
Sbjct: 1607 PVLSCGQNNVCNYASRNDKTFWLTTNAAIPMMPVENIEIRQYISRCVVCEAPANVIAVHS 1666
Query: 398 QTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
QT+++P CP GW LWIGYSF+MHT L SPGSCLE
Sbjct: 1667 QTIEVPDCPNGWEGLWIGYSFLMHTAVGNGGGGQALQSPGSCLE 1710
Score = 50.4 bits (115), Expect = 3e-05
Identities = 34/107 (31%), Positives = 47/107 (43%), Gaps = 8/107 (7%)
Frame = +2
Query: 68 HSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCD-LN 241
HSQ VP C G LW GYS L++ Q L GSC+ F PF+ C+
Sbjct: 1665 HSQTIEVPDCPNGWEGLWIGYSFLMHTAVGNGGGGQALQSPGSCLEDFRATPFIECNGAK 1724
Query: 242 DVCNYASRNDRSYW---LSTGQPI---PMMPVEGNEIVKYISRCVVC 364
C++ S+W L + QP ++ E ++SRC VC
Sbjct: 1725 GTCHF-YETMTSFWMYNLESSQPFERPQQQTIKAGERQSHVSRCQVC 1770
>UniRef50_Q4S0I4 Cluster: Chromosome 2 SCAF14781, whole genome shotgun
sequence; n=5; Euteleostomi|Rep: Chromosome 2 SCAF14781,
whole genome shotgun sequence - Tetraodon nigroviridis
(Green puffer)
Length = 1468
Score = 239 bits (586), Expect = 2e-62
Identities = 106/160 (66%), Positives = 120/160 (75%)
Frame = +2
Query: 50 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
G LLV+HSQ E +P C G KLW GYSLLY++G EKAHNQDLG AGSC+ +FSTMPFL+
Sbjct: 1276 GYLLVKHSQTEQIPMCPVGMAKLWSGYSLLYMEGQEKAHNQDLGLAGSCLPRFSTMPFLY 1335
Query: 230 CDLNDVCNYASRNDRSYWLSTGQPIPMMPVEGNEIVKYISRCVVCEVPSNVIAVHSQTLD 409
C+ D+C YASRND+SYWLST P+PMMPVE EI YISRC VCE PS IAVHSQ +
Sbjct: 1336 CNPGDICYYASRNDKSYWLSTTAPLPMMPVEDVEIKPYISRCSVCEAPSVAIAVHSQDIT 1395
Query: 410 IPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
IP CPVGW LWIGYSF+MHT L+SPGSCLE
Sbjct: 1396 IPQCPVGWRSLWIGYSFLMHTAAGNEGGGQSLSSPGSCLE 1435
Score = 59.3 bits (137), Expect = 5e-08
Identities = 30/82 (36%), Positives = 43/82 (52%), Gaps = 2/82 (2%)
Frame = +2
Query: 53 ILLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
+ + HSQ +PQC G LW GYS L++ + Q L GSC+ F T PF+
Sbjct: 1385 VAIAVHSQDITIPQCPVGWRSLWIGYSFLMHTAAGNEGGGQSLSSPGSCLEDFRTTPFIE 1444
Query: 230 CD-LNDVCNYASRNDRSYWLST 292
C+ C+Y + N S+WLS+
Sbjct: 1445 CNGAKGTCHYFA-NKHSFWLSS 1465
>UniRef50_P08572 Cluster: Collagen alpha-2(IV) chain precursor
[Contains: Canstatin]; n=48; Tetrapoda|Rep: Collagen
alpha-2(IV) chain precursor [Contains: Canstatin] - Homo
sapiens (Human)
Length = 1712
Score = 231 bits (565), Expect = 8e-60
Identities = 103/160 (64%), Positives = 116/160 (72%)
Frame = +2
Query: 50 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
G LLV+HSQ + P C G KLW GYSLLY +G EKAHNQDLG AGSC+ +FSTMPFL+
Sbjct: 1489 GYLLVKHSQTDQEPMCPVGMNKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLY 1548
Query: 230 CDLNDVCNYASRNDRSYWLSTGQPIPMMPVEGNEIVKYISRCVVCEVPSNVIAVHSQTLD 409
C+ DVC YASRND+SYWLST P+PMMPV +EI YISRC VCE P+ IAVHSQ +
Sbjct: 1549 CNPGDVCYYASRNDKSYWLSTTAPLPMMPVAEDEIKPYISRCSVCEAPAIAIAVHSQDVS 1608
Query: 410 IPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
IP CP GW LWIGYSF+MHT L SPGSCLE
Sbjct: 1609 IPHCPAGWRSLWIGYSFLMHTAAGDEGGGQSLVSPGSCLE 1648
Score = 56.8 bits (131), Expect = 3e-07
Identities = 38/112 (33%), Positives = 54/112 (48%), Gaps = 8/112 (7%)
Frame = +2
Query: 53 ILLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
I + HSQ +P C G LW GYS L++ ++ Q L GSC+ F PF+
Sbjct: 1598 IAIAVHSQDVSIPHCPAGWRSLWIGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFIE 1657
Query: 230 CD-LNDVCNYASRNDRSYWLST--GQPIPMMP----VEGNEIVKYISRCVVC 364
C+ C+Y + N S+WL+T Q P ++ I +ISRC VC
Sbjct: 1658 CNGGRGTCHYYA-NKYSFWLTTIPEQSFQGSPSADTLKAGLIRTHISRCQVC 1708
>UniRef50_UPI0000613E3C Cluster: Collagen alpha-2(IV) chain; n=2;
Bos taurus|Rep: Collagen alpha-2(IV) chain - Bos Taurus
Length = 227
Score = 229 bits (560), Expect = 3e-59
Identities = 102/160 (63%), Positives = 115/160 (71%)
Frame = +2
Query: 50 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
G LLV+HSQ + P C G KLW GYSLLY +G EKAHNQDLG AGSC+ +FSTMPFL+
Sbjct: 4 GYLLVKHSQTDKEPMCPVGMNKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLY 63
Query: 230 CDLNDVCNYASRNDRSYWLSTGQPIPMMPVEGNEIVKYISRCVVCEVPSNVIAVHSQTLD 409
C+ DVC YASRND+SYWLST P+PMMPV +I YISRC VCE P+ IAVHSQ +
Sbjct: 64 CNPGDVCYYASRNDKSYWLSTTAPLPMMPVAEEDIRPYISRCSVCEAPAVAIAVHSQDVS 123
Query: 410 IPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
IP CP GW LWIGYSF+MHT L SPGSCLE
Sbjct: 124 IPHCPAGWRSLWIGYSFLMHTAAGDEGGGQSLVSPGSCLE 163
Score = 57.6 bits (133), Expect = 2e-07
Identities = 37/112 (33%), Positives = 54/112 (48%), Gaps = 8/112 (7%)
Frame = +2
Query: 53 ILLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
+ + HSQ +P C G LW GYS L++ ++ Q L GSC+ F PF+
Sbjct: 113 VAIAVHSQDVSIPHCPAGWRSLWIGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFIE 172
Query: 230 CD-LNDVCNYASRNDRSYWLST--GQPIPMMP----VEGNEIVKYISRCVVC 364
C+ C+Y + N S+WL+T Q P ++ I +ISRC VC
Sbjct: 173 CNGARGTCHYYA-NKYSFWLTTIPEQSFQGTPSADTLKAGLIRTHISRCQVC 223
>UniRef50_Q4SZ73 Cluster: Chromosome undetermined SCAF11805, whole
genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome undetermined SCAF11805, whole genome shotgun
sequence - Tetraodon nigroviridis (Green puffer)
Length = 1026
Score = 228 bits (557), Expect = 7e-59
Identities = 101/160 (63%), Positives = 114/160 (71%)
Frame = +2
Query: 50 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
G LV+HSQ VP C G KLWDGYSLLY++G EKAHNQDLG GSC+ +FST+PFL+
Sbjct: 804 GYTLVKHSQDAQVPMCPQGMAKLWDGYSLLYVEGQEKAHNQDLGQPGSCLPRFSTIPFLY 863
Query: 230 CDLNDVCNYASRNDRSYWLSTGQPIPMMPVEGNEIVKYISRCVVCEVPSNVIAVHSQTLD 409
C N+VC YASRND+SYWLST IPMMPV +I YISRC VCE PS +AVHSQ +
Sbjct: 864 CSPNEVCYYASRNDKSYWLSTTASIPMMPVAEAQIQAYISRCSVCEAPSQAVAVHSQDMT 923
Query: 410 IPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
IP CP GW LWIGYSF+MHT L SPGSCLE
Sbjct: 924 IPTCPPGWRSLWIGYSFLMHTAAGAEGGGQSLVSPGSCLE 963
Score = 60.1 bits (139), Expect = 3e-08
Identities = 37/108 (34%), Positives = 52/108 (48%), Gaps = 9/108 (8%)
Frame = +2
Query: 68 HSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCD-LN 241
HSQ +P C PG LW GYS L++ + Q L GSC+ F PF+ C+
Sbjct: 918 HSQDMTIPTCPPGWRSLWIGYSFLMHTAAGAEGGGQSLVSPGSCLEDFRATPFIECNGAK 977
Query: 242 DVCNYASRNDRSYWLSTGQP------IPMM-PVEGNEIVKYISRCVVC 364
C+Y + N S+WL+T P P ++G + +SRC VC
Sbjct: 978 GTCHYFA-NKYSFWLTTVDPNQEFFYSPSQETLKGGQERSKVSRCQVC 1024
>UniRef50_P29400 Cluster: Collagen alpha-5(IV) chain precursor; n=61;
Eumetazoa|Rep: Collagen alpha-5(IV) chain precursor -
Homo sapiens (Human)
Length = 1685
Score = 227 bits (556), Expect = 9e-59
Identities = 98/162 (60%), Positives = 123/162 (75%), Gaps = 2/162 (1%)
Frame = +2
Query: 50 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
G L+ RHSQ PQC G +++++G+SLLY+ GN++AH QDLG AGSC+R+FSTMPF+F
Sbjct: 1461 GFLITRHSQTTDAPQCPQGTLQVYEGFSLLYVQGNKRAHGQDLGTAGSCLRRFSTMPFMF 1520
Query: 230 CDLNDVCNYASRNDRSYWLSTGQPIP--MMPVEGNEIVKYISRCVVCEVPSNVIAVHSQT 403
C++N+VCN+ASRND SYWLST +P+P M P++G I +ISRC VCE P+ VIAVHSQT
Sbjct: 1521 CNINNVCNFASRNDYSYWLSTPEPMPMSMQPLKGQSIQPFISRCAVCEAPAVVIAVHSQT 1580
Query: 404 LDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
+ IP CP GW LWIGYSF+MHT LASPGSCLE
Sbjct: 1581 IQIPHCPQGWDSLWIGYSFMMHTSAGAEGSGQALASPGSCLE 1622
Score = 65.7 bits (153), Expect = 6e-10
Identities = 37/111 (33%), Positives = 54/111 (48%), Gaps = 7/111 (6%)
Frame = +2
Query: 53 ILLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
+++ HSQ +P C G LW GYS +++ + Q L GSC+ +F + PF+
Sbjct: 1572 VVIAVHSQTIQIPHCPQGWDSLWIGYSFMMHTSAGAEGSGQALASPGSCLEEFRSAPFIE 1631
Query: 230 CDLNDVCNYASRNDRSYWLSTGQPIPMMPVEGNEIVK------YISRCVVC 364
C CNY + N S+WL+T M +E +K ISRC VC
Sbjct: 1632 CHGRGTCNYYA-NSYSFWLATVDVSDMFSKPQSETLKAGDLRTRISRCQVC 1681
>UniRef50_Q07265 Cluster: 3 alpha procollagen; n=4; Strongylocentrotus
purpuratus|Rep: 3 alpha procollagen - Strongylocentrotus
purpuratus (Purple sea urchin)
Length = 1752
Score = 221 bits (540), Expect = 8e-57
Identities = 93/163 (57%), Positives = 119/163 (73%), Gaps = 2/163 (1%)
Frame = +2
Query: 47 TGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFL 226
+G + RHSQ +PQC G K+W GYSLL++ GNE+ H QDLG GSC+++FSTMPFL
Sbjct: 1527 SGFFITRHSQTTSIPQCPQGTAKMWHGYSLLFVQGNERGHGQDLGKPGSCLKRFSTMPFL 1586
Query: 227 FCDLNDVCNYASRNDRSYWLSTGQPIP--MMPVEGNEIVKYISRCVVCEVPSNVIAVHSQ 400
FC++N+VC+ ASRND SYWLST +P+P M P+ G ++ +ISRCVVCE P+ V+ VHSQ
Sbjct: 1587 FCNINNVCHVASRNDYSYWLSTTEPMPMNMAPIRGGQLQPFISRCVVCEAPAQVLTVHSQ 1646
Query: 401 TLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
T++IP CP W LWIGYSF+MHT L+SPGSCLE
Sbjct: 1647 TVNIPDCPDRWGVLWIGYSFMMHTGPGGEGSGQMLSSPGSCLE 1689
Score = 55.2 bits (127), Expect = 9e-07
Identities = 36/110 (32%), Positives = 51/110 (46%), Gaps = 7/110 (6%)
Frame = +2
Query: 56 LLVRHSQREVVPQCEPGHVKLWDGYSLLYIDG-NEKAHNQDLGYAGSCVRKFSTMPFLFC 232
+L HSQ +P C LW GYS + G + Q L GSC+ F + PF+ C
Sbjct: 1640 VLTVHSQTVNIPDCPDRWGVLWIGYSFMMHTGPGGEGSGQMLSSPGSCLEDFRSSPFIEC 1699
Query: 233 DLNDVCNYASRNDRSYWLS--TGQPIPMMP----VEGNEIVKYISRCVVC 364
+ CNY + ++WLS TG MP ++ + +SRC VC
Sbjct: 1700 HGDGKCNYYA-TTYTFWLSSITGNAQFTMPQSETLKAGSLRTRVSRCAVC 1748
>UniRef50_P55787 Cluster: Collagen alpha-4(IV) chain; n=46;
Eumetazoa|Rep: Collagen alpha-4(IV) chain - Oryctolagus
cuniculus (Rabbit)
Length = 623
Score = 220 bits (538), Expect = 1e-56
Identities = 95/163 (58%), Positives = 115/163 (70%)
Frame = +2
Query: 41 YLTGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMP 220
YL+G LLV HSQ + P C G +LW GYSLLY++G EKAHNQDLG AGSC+ FST+P
Sbjct: 395 YLSGFLLVLHSQTDQEPACPMGMPRLWTGYSLLYLEGQEKAHNQDLGLAGSCLPIFSTLP 454
Query: 221 FLFCDLNDVCNYASRNDRSYWLSTGQPIPMMPVEGNEIVKYISRCVVCEVPSNVIAVHSQ 400
F +C+++ VC+YA RND+SYWL++ P+PMMP+ EI YISRC VCE P+ +AVHSQ
Sbjct: 455 FAYCNIHQVCHYAQRNDKSYWLASAGPLPMMPLSEEEIRPYISRCAVCEAPAQAVAVHSQ 514
Query: 401 TLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
IP CP W LWIGYSF+MHT L SPGSCLE
Sbjct: 515 DQSIPPCPRAWRSLWIGYSFLMHTGAGDQGGGQALMSPGSCLE 557
Score = 54.4 bits (125), Expect = 2e-06
Identities = 36/109 (33%), Positives = 50/109 (45%), Gaps = 10/109 (9%)
Frame = +2
Query: 68 HSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCD-LN 241
HSQ + +P C LW GYS L++ ++ Q L GSC+ F PFL C
Sbjct: 512 HSQDQSIPPCPRAWRSLWIGYSFLMHTGAGDQGGGQALMSPGSCLEDFRAAPFLECQGRQ 571
Query: 242 DVCNYASRNDRSYWLST--------GQPIPMMPVEGNEIVKYISRCVVC 364
C++ + N+ S+WL+T P P E + ISRC VC
Sbjct: 572 GTCHFFA-NEYSFWLTTVPPDLQVFSAPSPDTLKESQAQRQKISRCQVC 619
>UniRef50_P53420 Cluster: Collagen alpha-4(IV) chain precursor; n=36;
Euteleostomi|Rep: Collagen alpha-4(IV) chain precursor -
Homo sapiens (Human)
Length = 1690
Score = 217 bits (531), Expect = 1e-55
Identities = 94/163 (57%), Positives = 113/163 (69%)
Frame = +2
Query: 41 YLTGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMP 220
YL G LLV HSQ + P C G +LW GYSLLY++G EKAHNQDLG AGSC+ FST+P
Sbjct: 1462 YLGGFLLVLHSQTDQEPTCPLGMPRLWTGYSLLYLEGQEKAHNQDLGLAGSCLPVFSTLP 1521
Query: 221 FLFCDLNDVCNYASRNDRSYWLSTGQPIPMMPVEGNEIVKYISRCVVCEVPSNVIAVHSQ 400
F +C+++ VC+YA RNDRSYWL++ P+PMMP+ I Y+SRC VCE P+ +AVHSQ
Sbjct: 1522 FAYCNIHQVCHYAQRNDRSYWLASAAPLPMMPLSEEAIRPYVSRCAVCEAPAQAVAVHSQ 1581
Query: 401 TLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
IP CP W LWIGYSF+MHT L SPGSCLE
Sbjct: 1582 DQSIPPCPQTWRSLWIGYSFLMHTGAGDQGGGQALMSPGSCLE 1624
Score = 53.2 bits (122), Expect = 4e-06
Identities = 36/109 (33%), Positives = 49/109 (44%), Gaps = 10/109 (9%)
Frame = +2
Query: 68 HSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCD-LN 241
HSQ + +P C LW GYS L++ ++ Q L GSC+ F PFL C
Sbjct: 1579 HSQDQSIPPCPQTWRSLWIGYSFLMHTGAGDQGGGQALMSPGSCLEDFRAAPFLECQGRQ 1638
Query: 242 DVCNYASRNDRSYWLST--------GQPIPMMPVEGNEIVKYISRCVVC 364
C++ + N S+WL+T P P E + ISRC VC
Sbjct: 1639 GTCHFFA-NKYSFWLTTVKADLQFSSAPAPDTLKESQAQRQKISRCQVC 1686
>UniRef50_P17139-2 Cluster: Isoform b of P17139 ; n=2; Caenorhabditis
elegans|Rep: Isoform b of P17139 - Caenorhabditis elegans
Length = 1502
Score = 215 bits (524), Expect = 7e-55
Identities = 95/162 (58%), Positives = 114/162 (70%), Gaps = 2/162 (1%)
Frame = +2
Query: 50 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
G +HSQ VPQC PG +LW+GYSLLY+ GN +A QDLG GSC+ KF+TMPF+F
Sbjct: 1278 GFTFAKHSQTTAVPQCPPGASQLWEGYSLLYVQGNGRASGQDLGQPGSCLSKFNTMPFMF 1337
Query: 230 CDLNDVCNYASRNDRSYWLSTGQPI-PMM-PVEGNEIVKYISRCVVCEVPSNVIAVHSQT 403
C++N VC+ +SRND S+WLST +P+ PMM PV G I YISRC VCEVP+ +IAVHSQ
Sbjct: 1338 CNMNSVCHVSSRNDYSFWLSTDEPMTPMMNPVTGTAIRPYISRCAVCEVPTQIIAVHSQD 1397
Query: 404 LDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
+P CP GWS +W GYSFVMHT L SPGSCLE
Sbjct: 1398 TSVPQCPQGWSGMWTGYSFVMHTAAGAEGTGQSLQSPGSCLE 1439
Score = 57.2 bits (132), Expect = 2e-07
Identities = 34/110 (30%), Positives = 51/110 (46%), Gaps = 7/110 (6%)
Frame = +2
Query: 56 LLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFC 232
++ HSQ VPQC G +W GYS +++ + Q L GSC+ +F +PF+ C
Sbjct: 1390 IIAVHSQDTSVPQCPQGWSGMWTGYSFVMHTAAGAEGTGQSLQSPGSCLEEFRAVPFIEC 1449
Query: 233 DLNDVCNYASRNDRSYWLSTGQP-----IPM-MPVEGNEIVKYISRCVVC 364
CNY + N +W S PM ++ + +SRC VC
Sbjct: 1450 HGRGTCNYYATN-HGFWPSIVDQDKQFRKPMSQTLKAGGLKDRVSRCQVC 1498
>UniRef50_Q4SZ69 Cluster: Chromosome undetermined SCAF11805, whole
genome shotgun sequence; n=2; Tetraodon
nigroviridis|Rep: Chromosome undetermined SCAF11805,
whole genome shotgun sequence - Tetraodon nigroviridis
(Green puffer)
Length = 471
Score = 208 bits (508), Expect(2) = 1e-53
Identities = 89/141 (63%), Positives = 109/141 (77%), Gaps = 2/141 (1%)
Frame = +2
Query: 50 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
G L+ RHSQ + VP C G ++DGYSLLY+ GNE+AH QDLG AGSC+R+FSTMPF+F
Sbjct: 207 GFLITRHSQAQDVPYCPDGTNLIYDGYSLLYVQGNERAHGQDLGTAGSCLRRFSTMPFMF 266
Query: 230 CDLNDVCNYASRNDRSYWLSTGQPIP--MMPVEGNEIVKYISRCVVCEVPSNVIAVHSQT 403
C++N+VCN+ASRND SYWLST +P+P M P+ G I +ISRC VCE P+ VIAVHSQT
Sbjct: 267 CNINNVCNFASRNDYSYWLSTPEPMPMSMAPITGESIKPFISRCAVCEAPAMVIAVHSQT 326
Query: 404 LDIPGCPVGWSELWIGYSFVM 466
+ IP CP W LWIGYSF+M
Sbjct: 327 IQIPTCPANWEALWIGYSFMM 347
Score = 49.6 bits (113), Expect(2) = 7e-08
Identities = 27/71 (38%), Positives = 37/71 (52%), Gaps = 6/71 (8%)
Frame = +2
Query: 170 QDLGYAGSCVRKFSTMPFLFCDLNDVCNYASRNDRSYWLSTGQPIPMMPVEGNEIVK--- 340
Q L GSC+ +F + PF+ C CNY N S+WL+T +P M +E +K
Sbjct: 398 QALASPGSCLEEFRSAPFIECHGRGTCNYYG-NSYSFWLATVEPSEMFRKPQSETLKAGN 456
Query: 341 ---YISRCVVC 364
+SRCVVC
Sbjct: 457 LQTRVSRCVVC 467
Score = 29.1 bits (62), Expect(2) = 7e-08
Identities = 12/40 (30%), Positives = 19/40 (47%)
Frame = +2
Query: 53 ILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQ 172
+++ HSQ +P C LW GYS + + + H Q
Sbjct: 318 MVIAVHSQTIQIPTCPANWEALWIGYSFMMVGRDTHTHIQ 357
Score = 24.2 bits (50), Expect(2) = 1e-53
Identities = 11/21 (52%), Positives = 11/21 (52%)
Frame = +2
Query: 467 HTXXXXXXXXXXLASPGSCLE 529
HT LASPGSCLE
Sbjct: 388 HTSAGAEGSGQALASPGSCLE 408
>UniRef50_Q28084 Cluster: Collagen alpha-3(IV) chain; n=13; cellular
organisms|Rep: Collagen alpha-3(IV) chain - Bos taurus
(Bovine)
Length = 471
Score = 210 bits (512), Expect = 2e-53
Identities = 93/164 (56%), Positives = 113/164 (68%), Gaps = 2/164 (1%)
Frame = +2
Query: 44 LTGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPF 223
+ G + RHSQ +P C G L+ G+SLL++ GNE+AH QDLG GSC+++F+TMPF
Sbjct: 244 MRGFVFTRHSQTTAIPSCPEGTEPLYSGFSLLFVQGNEQAHGQDLGTLGSCLQRFTTMPF 303
Query: 224 LFCDLNDVCNYASRNDRSYWLSTGQPIP--MMPVEGNEIVKYISRCVVCEVPSNVIAVHS 397
LFC++NDVCN+ASRND SYWLST IP M P+ G + YISRC VCE P+ IAVHS
Sbjct: 304 LFCNINDVCNFASRNDYSYWLSTPAMIPMDMAPITGRALEPYISRCTVCEGPAIAIAVHS 363
Query: 398 QTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
QT DIP CP GW LW G+SF+M T LASPGSCLE
Sbjct: 364 QTTDIPPCPAGWISLWKGFSFIMFTSAGSEGAGQALASPGSCLE 407
Score = 70.9 bits (166), Expect = 2e-11
Identities = 39/112 (34%), Positives = 56/112 (50%), Gaps = 8/112 (7%)
Frame = +2
Query: 53 ILLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
I + HSQ +P C G + LW G+S +++ + Q L GSC+ +F PF+
Sbjct: 357 IAIAVHSQTTDIPPCPAGWISLWKGFSFIMFTSAGSEGAGQALASPGSCLEEFRASPFIE 416
Query: 230 CDLNDVCNYASRNDRSYWLST-------GQPIPMMPVEGNEIVKYISRCVVC 364
C CNY S N S+WL++ +PIP V+ E+ ISRC VC
Sbjct: 417 CHGRGTCNYYS-NSYSFWLASLDPKRMFRKPIP-STVKAGELENIISRCQVC 466
>UniRef50_UPI0000DBF028 Cluster: UPI0000DBF028 related cluster; n=9;
Rattus norvegicus|Rep: UPI0000DBF028 UniRef100 entry -
Rattus norvegicus
Length = 1549
Score = 209 bits (511), Expect = 3e-53
Identities = 90/139 (64%), Positives = 110/139 (79%)
Frame = +2
Query: 50 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
G LV+HSQ E VP C G +LW GYSLL+++G EKAHNQDLG+AGSC+ +FSTMPF++
Sbjct: 1412 GYTLVKHSQSEHVPPCPIGMSQLWVGYSLLFVEGQEKAHNQDLGFAGSCLPRFSTMPFIY 1471
Query: 230 CDLNDVCNYASRNDRSYWLSTGQPIPMMPVEGNEIVKYISRCVVCEVPSNVIAVHSQTLD 409
C++N+VC+YA RND+SYWLST PIPMMPV +I +YISRC VCE PS IAVHSQ
Sbjct: 1472 CNINEVCHYARRNDKSYWLSTTAPIPMMPVGETQIPQYISRCSVCEAPSQAIAVHSQD-T 1530
Query: 410 IPGCPVGWSELWIGYSFVM 466
+P CP+GW LWIGYSF+M
Sbjct: 1531 VPQCPLGWHSLWIGYSFLM 1549
>UniRef50_Q26640 Cluster: Alpha2(IV)-like collagen; n=4;
Strongylocentrotus purpuratus|Rep: Alpha2(IV)-like
collagen - Strongylocentrotus purpuratus (Purple sea
urchin)
Length = 1747
Score = 209 bits (510), Expect = 3e-53
Identities = 92/160 (57%), Positives = 114/160 (71%)
Frame = +2
Query: 50 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
G + RHSQ VP C G V+LW G+S+L+ GN AH+QDLG AGSC+++FSTMPFLF
Sbjct: 1526 GHFITRHSQSRNVPSCPAGTVELWRGFSVLFSMGNGHAHHQDLGDAGSCLQRFSTMPFLF 1585
Query: 230 CDLNDVCNYASRNDRSYWLSTGQPIPMMPVEGNEIVKYISRCVVCEVPSNVIAVHSQTLD 409
C+ N+VCNYASRNDRSYWL+T +P+PMMP+ +I YISRC VCE P+ +A+HSQ+ +
Sbjct: 1586 CNFNNVCNYASRNDRSYWLTTNEPLPMMPLMNQQIDPYISRCTVCEAPTQSLAIHSQSQE 1645
Query: 410 IPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
IP CP GW LW GYSF M+T L S GSCLE
Sbjct: 1646 IPQCPGGWRSLWTGYSFTMYT-AASEGGGQGLESVGSCLE 1684
Score = 64.1 bits (149), Expect = 2e-09
Identities = 35/109 (32%), Positives = 52/109 (47%), Gaps = 6/109 (5%)
Frame = +2
Query: 59 LVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCDL 238
L HSQ + +PQC G LW GYS + Q L GSC+ F PF+ C+
Sbjct: 1637 LAIHSQSQEIPQCPGGWRSLWTGYSFTMYTAASEGGGQGLESVGSCLENFRATPFIECNG 1696
Query: 239 NDVCNYASRNDRSYWLSTGQ-----PIP-MMPVEGNEIVKYISRCVVCE 367
C++ S N+ S+WL+ IP ++ ++ +SRC VC+
Sbjct: 1697 RGNCHFFS-NEYSFWLTVIDEEDQFAIPRKRTIKSGQLQSVVSRCRVCQ 1744
>UniRef50_Q01955 Cluster: Collagen alpha-3(IV) chain precursor
(Goodpasture antigen) [Contains: Tumstatin]; n=61;
Eumetazoa|Rep: Collagen alpha-3(IV) chain precursor
(Goodpasture antigen) [Contains: Tumstatin] - Homo
sapiens (Human)
Length = 1670
Score = 207 bits (505), Expect = 1e-52
Identities = 91/162 (56%), Positives = 112/162 (69%), Gaps = 2/162 (1%)
Frame = +2
Query: 50 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
G + RHSQ +P C G V L+ G+S L++ GN++AH QDLG GSC+++F+TMPFLF
Sbjct: 1445 GFVFTRHSQTTAIPSCPEGTVPLYSGFSFLFVQGNQRAHGQDLGTLGSCLQRFTTMPFLF 1504
Query: 230 CDLNDVCNYASRNDRSYWLSTG--QPIPMMPVEGNEIVKYISRCVVCEVPSNVIAVHSQT 403
C++NDVCN+ASRND SYWLST P+ M P+ G + YISRC VCE P+ IAVHSQT
Sbjct: 1505 CNVNDVCNFASRNDYSYWLSTPALMPMNMAPITGRALEPYISRCTVCEGPAIAIAVHSQT 1564
Query: 404 LDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
DIP CP GW LW G+SF+M T LASPGSCLE
Sbjct: 1565 TDIPPCPHGWISLWKGFSFIMFTSAGSEGTGQALASPGSCLE 1606
Score = 73.3 bits (172), Expect = 3e-12
Identities = 41/112 (36%), Positives = 57/112 (50%), Gaps = 8/112 (7%)
Frame = +2
Query: 53 ILLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
I + HSQ +P C G + LW G+S +++ + Q L GSC+ +F PFL
Sbjct: 1556 IAIAVHSQTTDIPPCPHGWISLWKGFSFIMFTSAGSEGTGQALASPGSCLEEFRASPFLE 1615
Query: 230 CDLNDVCNYASRNDRSYWLST-------GQPIPMMPVEGNEIVKYISRCVVC 364
C CNY S N S+WL++ +PIP V+ E+ K ISRC VC
Sbjct: 1616 CHGRGTCNYYS-NSYSFWLASLNPERMFRKPIP-STVKAGELEKIISRCQVC 1665
>UniRef50_Q7PVR6 Cluster: ENSANGP00000016652; n=3;
Endopterygota|Rep: ENSANGP00000016652 - Anopheles
gambiae str. PEST
Length = 461
Score = 202 bits (493), Expect = 4e-51
Identities = 84/162 (51%), Positives = 114/162 (70%), Gaps = 2/162 (1%)
Frame = +2
Query: 50 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
G L RHSQ+ +P+C KLWDGYSL+ + + ++ QDLG AGSC+R+FSTMPF+F
Sbjct: 184 GYLFARHSQKVTIPECPINTYKLWDGYSLVNVIASSRSVGQDLGAAGSCLRRFSTMPFMF 243
Query: 230 CDLNDVCNYASRNDRSYWLSTGQPIP--MMPVEGNEIVKYISRCVVCEVPSNVIAVHSQT 403
CD+N+VCNYAS ND + WL+T +P+P M P+ +++ +YISRC VCE + V+A+HSQ+
Sbjct: 244 CDINNVCNYASNNDDTIWLATPEPMPMSMAPIPADQVERYISRCSVCESNTRVMALHSQS 303
Query: 404 LDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
+ IP CP GW ELW+GYS+ MHT SPGSC+E
Sbjct: 304 MSIPDCPEGWEELWLGYSYAMHTSDNSGGFGQDFVSPGSCME 345
Score = 55.2 bits (127), Expect = 9e-07
Identities = 33/118 (27%), Positives = 56/118 (47%), Gaps = 8/118 (6%)
Frame = +2
Query: 56 LLVRHSQREVVPQCEPGHVKLWDGYSL-LYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFC 232
++ HSQ +P C G +LW GYS ++ N QD GSC+ +F P + C
Sbjct: 296 VMALHSQSMSIPDCPEGWEELWLGYSYAMHTSDNSGGFGQDFVSPGSCMEEFRPQPVIEC 355
Query: 233 DLNDVCNYASRNDRSYWLST-------GQPIPMMPVEGNEIVKYISRCVVCEVPSNVI 385
+ CN+ S+WL+ +P P ++ ++ K +SRC+VC + ++
Sbjct: 356 HGHGTCNFYD-GISSFWLTIIDDAMQFNRPQP-QTLKAHQTSK-VSRCIVCRRKAGIM 410
>UniRef50_UPI00015B49AB Cluster: PREDICTED: similar to alpha-5 type IV
collagen; n=1; Nasonia vitripennis|Rep: PREDICTED:
similar to alpha-5 type IV collagen - Nasonia vitripennis
Length = 1702
Score = 200 bits (489), Expect = 1e-50
Identities = 87/162 (53%), Positives = 111/162 (68%), Gaps = 2/162 (1%)
Frame = +2
Query: 50 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
G RHSQ ++P C VK+WDG+SLL++ GN AH QDLG GSC++KFS MPF
Sbjct: 1374 GFYFARHSQSAMIPVCPRNTVKMWDGFSLLHVMGNSYAHAQDLGTPGSCLKKFSVMPFNV 1433
Query: 230 CDLNDVCNYASRNDRSYWLSTGQ--PIPMMPVEGNEIVKYISRCVVCEVPSNVIAVHSQT 403
C+LN+VC+YA+RND SYWLS+ + P+ M P+ E+ YISRC VCE P+ +I +HSQ+
Sbjct: 1434 CNLNNVCDYANRNDYSYWLSSNEQMPMSMTPIPSREVGAYISRCSVCEAPTRLIVMHSQS 1493
Query: 404 LDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
+ IP CP GW ELW GYSF+MH L+SPGSCLE
Sbjct: 1494 MAIPECPGGWEELWAGYSFLMHRDAGAAGGGQPLSSPGSCLE 1535
Score = 66.1 bits (154), Expect = 5e-10
Identities = 37/109 (33%), Positives = 52/109 (47%), Gaps = 6/109 (5%)
Frame = +2
Query: 56 LLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFC 232
L+V HSQ +P+C G +LW GYS L++ D Q L GSC+ +F PF+ C
Sbjct: 1486 LIVMHSQSMAIPECPGGWEELWAGYSFLMHRDAGAAGGGQPLSSPGSCLEEFRARPFIEC 1545
Query: 233 DLNDVCNYASRNDRSYWLSTGQPI-----PMMPVEGNEIVKYISRCVVC 364
CN+ S SYW++T + P + +SRC VC
Sbjct: 1546 RGQGTCNFFS-TAVSYWMATIKDYEQFRKPQQQTLKTDHTSRVSRCSVC 1593
>UniRef50_UPI00015A592A Cluster: Type IV collagen alpha 4 chain; n=3;
Danio rerio|Rep: Type IV collagen alpha 4 chain - Danio
rerio
Length = 1639
Score = 200 bits (488), Expect = 2e-50
Identities = 90/161 (55%), Positives = 111/161 (68%)
Frame = +2
Query: 47 TGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFL 226
TG LLV HSQ VP C G +LW+GYSLLY++G E+AH QDLG AGSC+ FSTMPF
Sbjct: 1416 TGFLLVMHSQSRYVPTCPAGLTQLWNGYSLLYLEGQERAHTQDLGQAGSCLPVFSTMPFS 1475
Query: 227 FCDLNDVCNYASRNDRSYWLSTGQPIPMMPVEGNEIVKYISRCVVCEVPSNVIAVHSQTL 406
C++ D C+YASRND+SYWLST PIP P++G +I ++ISRCVVCE P+ IA+HSQ
Sbjct: 1476 CCNM-DTCDYASRNDKSYWLSTNAPIPNKPLKGQDIEEHISRCVVCEAPTPTIAIHSQDR 1534
Query: 407 DIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
P CP W LW G+SF+M+T L S GSCL+
Sbjct: 1535 LDPVCPPKWRSLWTGFSFMMYTGSGDEGGGQSLTSTGSCLQ 1575
Score = 49.2 bits (112), Expect = 6e-05
Identities = 32/107 (29%), Positives = 50/107 (46%), Gaps = 8/107 (7%)
Frame = +2
Query: 68 HSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCD-LN 241
HSQ + P C P LW G+S ++Y ++ Q L GSC++ F + PF+ C
Sbjct: 1530 HSQDRLDPVCPPKWRSLWTGFSFMMYTGSGDEGGGQSLTSTGSCLQDFRSQPFVECQGPR 1589
Query: 242 DVCNYASRNDRSYWLST----GQPIPMMPV--EGNEIVKYISRCVVC 364
C+Y + + S+W++ P PV E + SRC +C
Sbjct: 1590 GTCSYFA-SIYSFWMTIDMEHNDSSPHGPVITEERQQRDSTSRCSIC 1635
>UniRef50_Q4TZW9 Cluster: Type IV collagen alpha 4 chain; n=3; Danio
rerio|Rep: Type IV collagen alpha 4 chain - Danio rerio
(Zebrafish) (Brachydanio rerio)
Length = 240
Score = 200 bits (488), Expect = 2e-50
Identities = 90/161 (55%), Positives = 111/161 (68%)
Frame = +2
Query: 47 TGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFL 226
TG LLV HSQ VP C G +LW+GYSLLY++G E+AH QDLG AGSC+ FSTMPF
Sbjct: 14 TGFLLVMHSQSRYVPTCPAGLTQLWNGYSLLYLEGQERAHTQDLGQAGSCLPVFSTMPFS 73
Query: 227 FCDLNDVCNYASRNDRSYWLSTGQPIPMMPVEGNEIVKYISRCVVCEVPSNVIAVHSQTL 406
C++ D C+YASRND+SYWLST PIP P++G +I ++ISRCVVCE P+ IA+HSQ
Sbjct: 74 CCNM-DTCDYASRNDKSYWLSTNAPIPNKPLKGQDIEEHISRCVVCEAPTPTIAIHSQDR 132
Query: 407 DIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
P CP W LW G+SF+M+T L S GSCL+
Sbjct: 133 LDPVCPPKWRNLWTGFSFMMYTGSGDEGGGQSLTSTGSCLQ 173
Score = 49.2 bits (112), Expect = 6e-05
Identities = 32/107 (29%), Positives = 50/107 (46%), Gaps = 8/107 (7%)
Frame = +2
Query: 68 HSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCD-LN 241
HSQ + P C P LW G+S ++Y ++ Q L GSC++ F + PF+ C
Sbjct: 128 HSQDRLDPVCPPKWRNLWTGFSFMMYTGSGDEGGGQSLTSTGSCLQDFRSQPFVECQGPR 187
Query: 242 DVCNYASRNDRSYWLST----GQPIPMMPV--EGNEIVKYISRCVVC 364
C+Y + + S+W++ P PV E + SRC +C
Sbjct: 188 GTCSYFA-SIYSFWMTIDMEHNDSSPHGPVITEERQQRDSTSRCSIC 233
>UniRef50_Q58FS7 Cluster: Type IV collagen alpha 3 chain; n=2; Danio
rerio|Rep: Type IV collagen alpha 3 chain - Danio rerio
(Zebrafish) (Brachydanio rerio)
Length = 244
Score = 199 bits (486), Expect = 3e-50
Identities = 91/162 (56%), Positives = 111/162 (68%), Gaps = 2/162 (1%)
Frame = +2
Query: 50 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
G L RHSQ V+P+C G +L+ GYSLL+I+GN + H QDLG GSC+ F+TMPF+
Sbjct: 17 GFLFTRHSQTTVIPECPAGSKRLYTGYSLLFINGNNRGHGQDLGTLGSCLPMFNTMPFMV 76
Query: 230 CDLNDVCNYASRNDRSYWLSTGQP-IPMMPVEGNEIVK-YISRCVVCEVPSNVIAVHSQT 403
C+ ++ C YASRND SYWLST P +P + EI+K YISRC VCE +NVIA+HSQT
Sbjct: 77 CNRDETCRYASRNDYSYWLSTDTPMLPDQQMMSGEILKWYISRCSVCEAIANVIAIHSQT 136
Query: 404 LDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
++IP CPVGW LW GYSFVM T L SPGSCLE
Sbjct: 137 INIPQCPVGWLSLWEGYSFVMQTGVGAEGSGQPLVSPGSCLE 178
Score = 65.3 bits (152), Expect = 8e-10
Identities = 36/103 (34%), Positives = 51/103 (49%), Gaps = 4/103 (3%)
Frame = +2
Query: 68 HSQREVVPQCEPGHVKLWDGYSLLYIDG-NEKAHNQDLGYAGSCVRKFSTMPFLFCDLND 244
HSQ +PQC G + LW+GYS + G + Q L GSC+ +F +PF+ C
Sbjct: 133 HSQTINIPQCPVGWLSLWEGYSFVMQTGVGAEGSGQPLVSPGSCLEQFRKIPFIECHGRG 192
Query: 245 VCNYASRNDRSYWLSTGQPIPMMPVEGNEIVK---YISRCVVC 364
CN+ + SYWL++ M + + K ISRC VC
Sbjct: 193 TCNFYP-DSYSYWLASLDHTNMFSMPNRQTAKQKEIISRCQVC 234
>UniRef50_UPI00006608B5 Cluster: Homolog of Homo sapiens "Tumstatin;
n=1; Takifugu rubripes|Rep: Homolog of Homo sapiens
"Tumstatin - Takifugu rubripes
Length = 1374
Score = 198 bits (482), Expect = 9e-50
Identities = 89/160 (55%), Positives = 111/160 (69%), Gaps = 2/160 (1%)
Frame = +2
Query: 56 LLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCD 235
L RHSQ +P+C G +++ GYSLL+I+GN +AH QDLG GSC+ +F+TMPFLFC+
Sbjct: 1153 LFTRHSQELSIPECPVGSTEVYSGYSLLFINGNNRAHGQDLGTLGSCLPRFTTMPFLFCN 1212
Query: 236 LNDVCNYASRNDRSYWLSTGQPI-PMMP-VEGNEIVKYISRCVVCEVPSNVIAVHSQTLD 409
+ C YASRND SYWLST Q + MP + G+ + YISRC VCE +NVIA+HSQT
Sbjct: 1213 TDSTCRYASRNDYSYWLSTNQVVLSNMPLISGDLLRSYISRCSVCETRTNVIAIHSQTSV 1272
Query: 410 IPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
+P CP+GW LW+GYSFVM T LASPGSCLE
Sbjct: 1273 VPDCPLGWLPLWVGYSFVMETGVGAEGSGQPLASPGSCLE 1312
Score = 65.3 bits (152), Expect = 8e-10
Identities = 40/113 (35%), Positives = 56/113 (49%), Gaps = 7/113 (6%)
Frame = +2
Query: 47 TGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDG-NEKAHNQDLGYAGSCVRKFSTMPF 223
T ++ + HSQ VVP C G + LW GYS + G + Q L GSC+ +F +PF
Sbjct: 1261 TNVIAI-HSQTSVVPDCPLGWLPLWVGYSFVMETGVGAEGSGQPLASPGSCLEQFRKIPF 1319
Query: 224 LFCDLNDVCNYASRNDRSYWLSTGQPIPMM--PVEGNEIVKY----ISRCVVC 364
+ C CNY + + SYWL+ P M P + ++ ISRC VC
Sbjct: 1320 IECHGRGTCNYYT-DSYSYWLAALSPHDMFSKPKPHTDTGEFPGSLISRCRVC 1371
>UniRef50_UPI000065E567 Cluster: Homolog of Brachydanio rerio
"Collagen, type I, alpha 3.; n=1; Takifugu rubripes|Rep:
Homolog of Brachydanio rerio "Collagen, type I, alpha 3.
- Takifugu rubripes
Length = 1426
Score = 195 bits (475), Expect = 6e-49
Identities = 90/160 (56%), Positives = 104/160 (65%)
Frame = +2
Query: 50 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
G LLV HSQ VP+C G LW GYSL Y+ G + AH QDLG AGSC+R FSTMPF +
Sbjct: 1205 GFLLVIHSQSVQVPKCPDGSSLLWVGYSLAYLKGQKNAHAQDLGQAGSCLRVFSTMPFSY 1264
Query: 230 CDLNDVCNYASRNDRSYWLSTGQPIPMMPVEGNEIVKYISRCVVCEVPSNVIAVHSQTLD 409
C+ C+++SRND+SYWLST PIPMMPV G EI +ISRCVVCE S + HSQ
Sbjct: 1265 CN-KAACHFSSRNDKSYWLSTAAPIPMMPVFGQEISSHISRCVVCETVSPAVVFHSQEHT 1323
Query: 410 IPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
P CP GW LW GYSF+MHT L S GSCL+
Sbjct: 1324 APACPQGWRSLWTGYSFLMHTGAGDEGSGQALTSSGSCLK 1363
Score = 42.3 bits (95), Expect = 0.007
Identities = 19/52 (36%), Positives = 28/52 (53%)
Frame = +2
Query: 371 PSNVIAVHSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCL 526
P ++ +HSQ++ +P CP G S LW+GYS + + L GSCL
Sbjct: 1204 PGFLLVIHSQSVQVPKCPDGSSLLWVGYS-LAYLKGQKNAHAQDLGQAGSCL 1254
>UniRef50_Q9GQB1 Cluster: Type IV collagen alpha 1 chain precursor;
n=1; Hydra vulgaris|Rep: Type IV collagen alpha 1 chain
precursor - Hydra attenuata (Hydra) (Hydra vulgaris)
Length = 1723
Score = 189 bits (461), Expect = 3e-47
Identities = 84/160 (52%), Positives = 105/160 (65%)
Frame = +2
Query: 50 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
G LV+HSQ VP C G +W+GYS LY GNE+A QDLG GSC+++FSTMPFLF
Sbjct: 1501 GFYLVKHSQSIKVPSCPAGMQTMWEGYSFLYAQGNERAFGQDLGQPGSCLKRFSTMPFLF 1560
Query: 230 CDLNDVCNYASRNDRSYWLSTGQPIPMMPVEGNEIVKYISRCVVCEVPSNVIAVHSQTLD 409
CD+ + C ASRND S+WLST + P G ++ YISRC+VCE PS+V+AVHSQ+
Sbjct: 1561 CDIQNKCVVASRNDYSFWLSTAEKPKEAPSSGADLENYISRCIVCEAPSHVLAVHSQSEL 1620
Query: 410 IPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
P CP GW LW G+SF+M+ L+S GSCLE
Sbjct: 1621 DPKCPDGWENLWTGFSFLMYNSAGAQGSGQLLSSSGSCLE 1660
Score = 61.7 bits (143), Expect = 1e-08
Identities = 36/109 (33%), Positives = 52/109 (47%), Gaps = 6/109 (5%)
Frame = +2
Query: 56 LLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFC 232
+L HSQ E+ P+C G LW G+S L+Y + Q L +GSC+ F P++ C
Sbjct: 1611 VLAVHSQSELDPKCPDGWENLWTGFSFLMYNSAGAQGSGQLLSSSGSCLEDFRVNPYIEC 1670
Query: 233 DLNDVCNYASRNDRSYWLST-GQ----PIPMMPVEGNEIVKYISRCVVC 364
C Y S+WLST G+ +P + + +SRC VC
Sbjct: 1671 HGRGTCWYYGPT-LSFWLSTIGESNMFQVPKFEILERNLKARVSRCAVC 1718
>UniRef50_UPI000065E566 Cluster: Homolog of Homo sapiens "Splice
Isoform 1 of Collagen alpha 3; n=1; Takifugu
rubripes|Rep: Homolog of Homo sapiens "Splice Isoform 1
of Collagen alpha 3 - Takifugu rubripes
Length = 1258
Score = 188 bits (458), Expect = 7e-47
Identities = 87/160 (54%), Positives = 102/160 (63%), Gaps = 2/160 (1%)
Frame = +2
Query: 56 LLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCD 235
++ RHSQ +P C G L+ GYS L++ N++ H QDLG GSC+ FSTMPFL CD
Sbjct: 1039 MIARHSQSIHIPVCPCGTSLLFSGYSFLFMHANDRVHGQDLGTPGSCLPHFSTMPFLVCD 1098
Query: 236 LNDVCNYASRNDRSYWLSTGQPIP--MMPVEGNEIVKYISRCVVCEVPSNVIAVHSQTLD 409
C YASRND SYWLSTG+ +P M+ + G+ + YISRC VCE SNVIAVHSQT
Sbjct: 1099 TESNCRYASRNDYSYWLSTGKALPENMVSITGDMLASYISRCAVCETTSNVIAVHSQTTQ 1158
Query: 410 IPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
IP CP W LW GYSFVM T L SPGSCLE
Sbjct: 1159 IPDCPQDWVSLWSGYSFVMQTGIGADGSSQPLISPGSCLE 1198
Score = 61.7 bits (143), Expect = 1e-08
Identities = 37/114 (32%), Positives = 54/114 (47%), Gaps = 8/114 (7%)
Frame = +2
Query: 47 TGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDG-NEKAHNQDLGYAGSCVRKFSTMPF 223
T ++ HSQ +P C V LW GYS + G +Q L GSC+ F +PF
Sbjct: 1146 TSNVIAVHSQTTQIPDCPQDWVSLWSGYSFVMQTGIGADGSSQPLISPGSCLETFRKVPF 1205
Query: 224 LFCDLNDVCNYASRNDRSYWLST-------GQPIPMMPVEGNEIVKYISRCVVC 364
+ C CNY + S+W+++ G+PIP V+ + +SRC VC
Sbjct: 1206 IECHGRGTCNYYP-DSYSFWMASLDPKNMFGKPIP-QTVKEPSLQSILSRCRVC 1257
>UniRef50_Q9VMV5 Cluster: CG16858-PA; n=6; Schizophora|Rep: CG16858-PA
- Drosophila melanogaster (Fruit fly)
Length = 1940
Score = 186 bits (452), Expect = 4e-46
Identities = 80/162 (49%), Positives = 107/162 (66%), Gaps = 2/162 (1%)
Frame = +2
Query: 50 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
G + RHSQ VPQC LW+GYSL +A QDLG +GSC+ +F+TMP++
Sbjct: 1515 GFIFARHSQSVHVPQCPANTNLLWEGYSLSGNVAASRAVGQDLGQSGSCMMRFTTMPYML 1574
Query: 230 CDLNDVCNYASRNDRSYWLSTGQPIPMM--PVEGNEIVKYISRCVVCEVPSNVIAVHSQT 403
CD+ +VC++A ND S WLST +P+PM P++G +++KYISRCVVCE + +IA+HSQ+
Sbjct: 1575 CDITNVCHFAQNNDDSLWLSTAEPMPMTMTPIQGRDLMKYISRCVVCETTTRIIALHSQS 1634
Query: 404 LDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
+ IP CP GW E+W GYS+ M T L SPGSCLE
Sbjct: 1635 MSIPDCPGGWEEMWTGYSYFMSTLDNVGGVGQNLVSPGSCLE 1676
Score = 50.8 bits (116), Expect = 2e-05
Identities = 33/114 (28%), Positives = 47/114 (41%), Gaps = 6/114 (5%)
Frame = +2
Query: 56 LLVRHSQREVVPQCEPGHVKLWDGYSLLYID-GNEKAHNQDLGYAGSCVRKFSTMPFLFC 232
++ HSQ +P C G ++W GYS N Q+L GSC+ +F P + C
Sbjct: 1627 IIALHSQSMSIPDCPGGWEEMWTGYSYFMSTLDNVGGVGQNLVSPGSCLEEFRAQPVIEC 1686
Query: 233 DLNDVCNYASRNDRSYWLSTGQP-----IPMMPVEGNEIVKYISRCVVCEVPSN 379
+ CNY S+WL+ + P + ISRC VC N
Sbjct: 1687 HGHGRCNYYDAL-ASFWLTVIEEQDQFVQPRQQTLKADFTSKISRCTVCRRRGN 1739
>UniRef50_UPI0000DA44CD Cluster: PREDICTED: similar to procollagen,
type IV, alpha 6; n=1; Rattus norvegicus|Rep: PREDICTED:
similar to procollagen, type IV, alpha 6 - Rattus
norvegicus
Length = 1405
Score = 179 bits (436), Expect = 3e-44
Identities = 79/130 (60%), Positives = 98/130 (75%)
Frame = +2
Query: 50 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
G LV+HSQ E VP C G +LW GYSLL+++G EKAHNQDLG+AGSC+ +FSTMPF++
Sbjct: 1248 GYTLVKHSQSEHVPPCPIGMSQLWVGYSLLFVEGQEKAHNQDLGFAGSCLPRFSTMPFIY 1307
Query: 230 CDLNDVCNYASRNDRSYWLSTGQPIPMMPVEGNEIVKYISRCVVCEVPSNVIAVHSQTLD 409
C++N+VC+YA RND+SYWLST PIPMMPV +I +YISRC VCE PS IAVHSQ
Sbjct: 1308 CNINEVCHYARRNDKSYWLSTTAPIPMMPVGETQIPQYISRCSVCEAPSQAIAVHSQDNH 1367
Query: 410 IPGCPVGWSE 439
P G ++
Sbjct: 1368 RSTVPFGLAQ 1377
Score = 41.1 bits (92), Expect = 0.016
Identities = 18/45 (40%), Positives = 25/45 (55%)
Frame = +2
Query: 392 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCL 526
HSQ+ +P CP+G S+LW+GYS ++ L GSCL
Sbjct: 1254 HSQSEHVPPCPIGMSQLWVGYS-LLFVEGQEKAHNQDLGFAGSCL 1297
>UniRef50_O09238 Cluster: Collagen type IV; n=2; Pseudocorticium
jarrei|Rep: Collagen type IV - Pseudocorticium jarrei
Length = 854
Score = 159 bits (386), Expect = 4e-38
Identities = 76/160 (47%), Positives = 95/160 (59%)
Frame = +2
Query: 50 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
G+LLV HSQ +PQC + +LW GYSLL + GN QDLG GSC+ F MP +
Sbjct: 629 GLLLVVHSQTTNIPQCPNDYTRLWVGYSLLQLTGNGLGVGQDLGDPGSCMPSFHPMPVVR 688
Query: 230 CDLNDVCNYASRNDRSYWLSTGQPIPMMPVEGNEIVKYISRCVVCEVPSNVIAVHSQTLD 409
C+ C +A R D SYWLST P +PV G++I ++ISRC VCE S IAVHSQ +
Sbjct: 689 CNPMQRCEFARRKDESYWLSTNATRPPIPVSGSDIEEHISRCSVCESNSISIAVHSQDSN 748
Query: 410 IPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
+P C GW LW G+SF+ T L SPGSCL+
Sbjct: 749 VPDCFPGWVTLWTGFSFLQQTAAQAEGTGQGLESPGSCLQ 788
Score = 62.5 bits (145), Expect = 6e-09
Identities = 40/112 (35%), Positives = 55/112 (49%), Gaps = 7/112 (6%)
Frame = +2
Query: 53 ILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNE-KAHNQDLGYAGSCVRKFSTMPFLF 229
I + HSQ VP C PG V LW G+S L + + Q L GSC++ F + PF+
Sbjct: 738 ISIAVHSQDSNVPDCFPGWVTLWTGFSFLQQTAAQAEGTGQGLESPGSCLQHFRSTPFIG 797
Query: 230 CDLNDVCNYASRNDRSYWLSTGQPI-PMMPVEG-----NEIVKYISRCVVCE 367
C C+Y S + SYW+ + P E ++I K +SRC VCE
Sbjct: 798 CGGRGQCSYDSVSG-SYWMIVLDALNPFQDTEPGTYPVSDIEKRLSRCRVCE 848
>UniRef50_Q5C3P1 Cluster: SJCHGC06113 protein; n=2;
Platyhelminthes|Rep: SJCHGC06113 protein - Schistosoma
japonicum (Blood fluke)
Length = 587
Score = 131 bits (316), Expect = 1e-29
Identities = 66/163 (40%), Positives = 90/163 (55%), Gaps = 2/163 (1%)
Frame = +2
Query: 47 TGILLVRHSQREVVPQ--CEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMP 220
+ IL RH Q V C G KL+ GYS + G + + DLG SC+ KFS++P
Sbjct: 358 SSILFARHYQTPFVENLTCPGGTNKLFTGYSYVMGGGVDDLVSMDLGTPSSCLSKFSSLP 417
Query: 221 FLFCDLNDVCNYASRNDRSYWLSTGQPIPMMPVEGNEIVKYISRCVVCEVPSNVIAVHSQ 400
C+ + C + R++RSYWL+T P P+ N+ I+RCVVCE PS+V A HSQ
Sbjct: 418 MTQCERDTTCQSSMRHERSYWLATLVPRSEQPIPVNQTADQIARCVVCEAPSHVFAFHSQ 477
Query: 401 TLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
+ CP W+ELW G S ++HT L+SPGSC+E
Sbjct: 478 GETLQPCPSTWTELWTGVSLILHT-SGAHGGGQQLSSPGSCME 519
Score = 44.4 bits (100), Expect = 0.002
Identities = 33/115 (28%), Positives = 51/115 (44%), Gaps = 12/115 (10%)
Frame = +2
Query: 68 HSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCDLN-D 244
HSQ E + C +LW G SL+ Q L GSC+ F P + C+ N
Sbjct: 475 HSQGETLQPCPSTWTELWTGVSLILHTSGAHGGGQQLSSPGSCMEHFRYSPVIECNNNVG 534
Query: 245 VCNYASRNDRSYWLSTGQP----------IPMMPVEGNEIVKYISRCVVC-EVPS 376
+C+Y S + + Y+L P M EG ++ +S+C VC ++P+
Sbjct: 535 MCHYWS-DAKVYYLRALNPNITQFEKPVGFVMKAAEG-PVLNNVSKCRVCMKIPA 587
>UniRef50_Q4SB07 Cluster: Chromosome undetermined SCAF14677, whole
genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome undetermined SCAF14677, whole genome shotgun
sequence - Tetraodon nigroviridis (Green puffer)
Length = 856
Score = 118 bits (284), Expect = 8e-26
Identities = 53/101 (52%), Positives = 71/101 (70%), Gaps = 2/101 (1%)
Frame = +2
Query: 56 LLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCD 235
L RHSQ +P+C G +++ GYSLL+I+GN +AH QDLG GSC+ +F+TMPFLFC+
Sbjct: 737 LFTRHSQELYIPECPAGSTQVYSGYSLLFINGNNRAHGQDLGTLGSCLPRFTTMPFLFCN 796
Query: 236 LNDVCNYASRNDRSYWLSTGQPI-PMMP-VEGNEIVKYISR 352
+ C YASRND SYWLST + + MP + G+ + YISR
Sbjct: 797 TDRTCRYASRNDYSYWLSTNKMVLSNMPLISGDLLRSYISR 837
Score = 32.7 bits (71), Expect = 5.4
Identities = 17/45 (37%), Positives = 24/45 (53%)
Frame = +2
Query: 392 HSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCL 526
HSQ L IP CP G ++++ GYS ++ L + GSCL
Sbjct: 741 HSQELYIPECPAGSTQVYSGYS-LLFINGNNRAHGQDLGTLGSCL 784
>UniRef50_A7T3G2 Cluster: Predicted protein; n=1; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 590
Score = 109 bits (261), Expect = 5e-23
Identities = 43/76 (56%), Positives = 59/76 (77%)
Frame = +2
Query: 50 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
G +V+HSQ P+C P + KLWDGYSLLY+ G++ +H QDLG AGSC+++F+TMP+L+
Sbjct: 475 GFYIVKHSQTTTPPECPPTYDKLWDGYSLLYVQGHDVSHGQDLGQAGSCLKRFTTMPYLY 534
Query: 230 CDLNDVCNYASRNDRS 277
C++ CNYASRND S
Sbjct: 535 CNIFGKCNYASRNDYS 550
Score = 35.9 bits (79), Expect = 0.58
Identities = 18/49 (36%), Positives = 24/49 (48%)
Frame = +2
Query: 383 IAVHSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
I HSQT P CP + +LW GYS +++ L GSCL+
Sbjct: 478 IVKHSQTTTPPECPPTYDKLWDGYS-LLYVQGHDVSHGQDLGQAGSCLK 525
>UniRef50_A7T795 Cluster: Predicted protein; n=1; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 331
Score = 108 bits (259), Expect = 9e-23
Identities = 42/74 (56%), Positives = 58/74 (78%)
Frame = +2
Query: 50 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 229
G +V+HSQ P+C P + KLWDGYSLLY+ G++ +H QDLG AGSC+++F+TMP+L+
Sbjct: 256 GFYIVKHSQTTTPPECPPTYDKLWDGYSLLYVQGHDVSHGQDLGQAGSCLKRFTTMPYLY 315
Query: 230 CDLNDVCNYASRND 271
C++ CNYASRND
Sbjct: 316 CNIFGKCNYASRND 329
Score = 35.9 bits (79), Expect = 0.58
Identities = 18/49 (36%), Positives = 24/49 (48%)
Frame = +2
Query: 383 IAVHSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCLE 529
I HSQT P CP + +LW GYS +++ L GSCL+
Sbjct: 259 IVKHSQTTTPPECPPTYDKLWDGYS-LLYVQGHDVSHGQDLGQAGSCLK 306
>UniRef50_Q5BYE6 Cluster: SJCHGC08138 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC08138 protein - Schistosoma
japonicum (Blood fluke)
Length = 206
Score = 93.5 bits (222), Expect = 3e-18
Identities = 46/109 (42%), Positives = 60/109 (55%), Gaps = 2/109 (1%)
Frame = +2
Query: 47 TGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPF- 223
+G L HSQ P C ++ GYSL+ + G++ + DLG GSC+RKFS MPF
Sbjct: 92 SGFLFTVHSQDSQPPSCPIYTTPVYTGYSLVTLQGDDDSTTMDLGTPGSCLRKFSIMPFA 151
Query: 224 -LFCDLNDVCNYASRNDRSYWLSTGQPIPMMPVEGNEIVKYISRCVVCE 367
F +N C RN RSYWLST + + P I YISRC+VC+
Sbjct: 152 NCFAKVNGNCQINMRNGRSYWLSTLEQYMLSPARVENIKPYISRCIVCQ 200
Score = 33.1 bits (72), Expect = 4.1
Identities = 17/49 (34%), Positives = 23/49 (46%)
Frame = +2
Query: 380 VIAVHSQTLDIPGCPVGWSELWIGYSFVMHTXXXXXXXXXXLASPGSCL 526
+ VHSQ P CP+ + ++ GYS V L +PGSCL
Sbjct: 95 LFTVHSQDSQPPSCPIYTTPVYTGYSLVT-LQGDDDSTTMDLGTPGSCL 142
>UniRef50_Q4WVM5 Cluster: DNA polymerase gamma; n=13;
Pezizomycotina|Rep: DNA polymerase gamma - Aspergillus
fumigatus (Sartorya fumigata)
Length = 1135
Score = 33.1 bits (72), Expect = 4.1
Identities = 17/54 (31%), Positives = 24/54 (44%)
Frame = +2
Query: 119 WDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCDLNDVCNYASRNDRSY 280
WDGY L + D V+KF P + CD+++ N RNDR +
Sbjct: 533 WDGYPLTWSD----KFGWTFKVPKDQVKKFENQPVVLCDMSEEKNLELRNDRKH 582
>UniRef50_A4RDD1 Cluster: Putative uncharacterized protein; n=1;
Magnaporthe grisea|Rep: Putative uncharacterized protein
- Magnaporthe grisea (Rice blast fungus) (Pyricularia
grisea)
Length = 767
Score = 32.7 bits (71), Expect = 5.4
Identities = 16/39 (41%), Positives = 22/39 (56%)
Frame = +2
Query: 236 LNDVCNYASRNDRSYWLSTGQPIPMMPVEGNEIVKYISR 352
+ ++A RN +LS G PIP MPV+ N V +SR
Sbjct: 49 VTSTASFADRNLVVSYLSDGDPIPSMPVDRNLTVSPLSR 87
>UniRef50_Q4T5R1 Cluster: Chromosome undetermined SCAF9151, whole
genome shotgun sequence; n=1; Tetraodon
nigroviridis|Rep: Chromosome undetermined SCAF9151,
whole genome shotgun sequence - Tetraodon nigroviridis
(Green puffer)
Length = 704
Score = 31.9 bits (69), Expect = 9.5
Identities = 14/36 (38%), Positives = 22/36 (61%)
Frame = +2
Query: 86 VPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGS 193
VP+ PGHV+L G +L G+ + H ++LG A +
Sbjct: 113 VPERLPGHVRLVHGQQVLPGQGDVRLHREELGIAAA 148
>UniRef50_A1G1L8 Cluster: Glycoside hydrolase family 2,
immunoglobulin-like beta-sandwich; n=1; Stenotrophomonas
maltophilia R551-3|Rep: Glycoside hydrolase family 2,
immunoglobulin-like beta-sandwich - Stenotrophomonas
maltophilia R551-3
Length = 895
Score = 31.9 bits (69), Expect = 9.5
Identities = 23/70 (32%), Positives = 37/70 (52%)
Frame = +2
Query: 113 KLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCDLNDVCNYASRNDRSYWLST 292
++W GY L+ GN+ Q +G G V +S+ P DL++ N ++R D+ YW
Sbjct: 518 RVWQGYVDLF--GNDL--RQVVGEEGLGVPYWSSSPSN--DLDEKANDSTRGDKHYWQVW 571
Query: 293 GQPIPMMPVE 322
G P +PV+
Sbjct: 572 GN--PALPVQ 579
>UniRef50_A0CNR1 Cluster: Chromosome undetermined scaffold_22, whole
genome shotgun sequence; n=1; Paramecium tetraurelia|Rep:
Chromosome undetermined scaffold_22, whole genome shotgun
sequence - Paramecium tetraurelia
Length = 1157
Score = 31.9 bits (69), Expect = 9.5
Identities = 16/55 (29%), Positives = 25/55 (45%)
Frame = +2
Query: 242 DVCNYASRNDRSYWLSTGQPIPMMPVEGNEIVKYISRCVVCEVPSNVIAVHSQTL 406
D CN A R Y+L+ GQP P +E I++ + + + S+TL
Sbjct: 994 DACNEALNYQRQYYLNQGQPYPWEQLEYQAIIQKDKLSTILRNMEDKVLTWSKTL 1048
Database: uniref50
Posted date: Oct 5, 2007 11:19 AM
Number of letters in database: 575,637,011
Number of sequences in database: 1,657,284
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.279 0.0580 0.190
Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 538,477,166
Number of Sequences: 1657284
Number of extensions: 10842631
Number of successful extensions: 23817
Number of sequences better than 10.0: 38
Number of HSP's better than 10.0 without gapping: 23046
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 23741
length of database: 575,637,011
effective HSP length: 95
effective length of database: 418,195,031
effective search space used: 33873797511
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)
- SilkBase 1999-2023 -