BLASTX 2.2.12 [Aug-07-2005]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= I10A02NGRL0002_I23
(342 letters)
Database: uniref50
1,657,284 sequences; 575,637,011 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
UniRef50_UPI0000DB7985 Cluster: PREDICTED: similar to Collagen t... 146 6e-35
UniRef50_Q4S0I4 Cluster: Chromosome 2 SCAF14781, whole genome sh... 143 8e-34
UniRef50_P08120 Cluster: Collagen alpha-1(IV) chain precursor; n... 142 2e-33
UniRef50_UPI0000DA44CD Cluster: PREDICTED: similar to procollage... 140 7e-33
UniRef50_UPI0000DBF028 Cluster: UPI0000DBF028 related cluster; n... 140 7e-33
UniRef50_UPI0000613E3C Cluster: Collagen alpha-2(IV) chain; n=2;... 139 1e-32
UniRef50_P08572 Cluster: Collagen alpha-2(IV) chain precursor [C... 139 1e-32
UniRef50_Q4SZ69 Cluster: Chromosome undetermined SCAF11805, whol... 138 2e-32
UniRef50_Q4SZ73 Cluster: Chromosome undetermined SCAF11805, whol... 137 5e-32
UniRef50_P29400 Cluster: Collagen alpha-5(IV) chain precursor; n... 136 9e-32
UniRef50_Q07265 Cluster: 3 alpha procollagen; n=4; Strongylocent... 134 5e-31
UniRef50_P53420 Cluster: Collagen alpha-4(IV) chain precursor; n... 134 5e-31
UniRef50_Q26640 Cluster: Alpha2(IV)-like collagen; n=4; Strongyl... 133 6e-31
UniRef50_P55787 Cluster: Collagen alpha-4(IV) chain; n=46; Eumet... 133 6e-31
UniRef50_UPI00015A592A Cluster: Type IV collagen alpha 4 chain; ... 127 4e-29
UniRef50_Q4TZW9 Cluster: Type IV collagen alpha 4 chain; n=3; Da... 127 4e-29
UniRef50_Q28084 Cluster: Collagen alpha-3(IV) chain; n=13; cellu... 126 7e-29
UniRef50_P17139-2 Cluster: Isoform b of P17139 ; n=2; Caenorhabd... 125 2e-28
UniRef50_Q01955 Cluster: Collagen alpha-3(IV) chain precursor (G... 124 4e-28
UniRef50_Q7PVR6 Cluster: ENSANGP00000016652; n=3; Endopterygota|... 121 3e-27
UniRef50_UPI00015B49AB Cluster: PREDICTED: similar to alpha-5 ty... 118 2e-26
UniRef50_UPI000065E567 Cluster: Homolog of Brachydanio rerio "Co... 117 6e-26
UniRef50_UPI00006608B5 Cluster: Homolog of Homo sapiens "Tumstat... 116 8e-26
UniRef50_Q58FS7 Cluster: Type IV collagen alpha 3 chain; n=2; Da... 114 3e-25
UniRef50_Q4SB07 Cluster: Chromosome undetermined SCAF14677, whol... 114 4e-25
UniRef50_Q9GQB1 Cluster: Type IV collagen alpha 1 chain precurso... 113 7e-25
UniRef50_A7T3G2 Cluster: Predicted protein; n=1; Nematostella ve... 109 2e-23
UniRef50_A7T795 Cluster: Predicted protein; n=1; Nematostella ve... 108 3e-23
UniRef50_UPI000065E566 Cluster: Homolog of Homo sapiens "Splice ... 104 3e-22
UniRef50_Q9VMV5 Cluster: CG16858-PA; n=6; Schizophora|Rep: CG168... 100 1e-20
UniRef50_O09238 Cluster: Collagen type IV; n=2; Pseudocorticium ... 88 3e-17
UniRef50_Q5BYE6 Cluster: SJCHGC08138 protein; n=1; Schistosoma j... 75 4e-13
UniRef50_Q5C3P1 Cluster: SJCHGC06113 protein; n=2; Platyhelminth... 66 2e-10
UniRef50_Q4WVM5 Cluster: DNA polymerase gamma; n=13; Pezizomycot... 33 1.2
UniRef50_Q4T5R1 Cluster: Chromosome undetermined SCAF9151, whole... 32 2.9
UniRef50_A1G1L8 Cluster: Glycoside hydrolase family 2, immunoglo... 31 3.8
UniRef50_A3UAL4 Cluster: Dihydrolipoamide dehydrogenase; n=1; Cr... 31 6.7
UniRef50_A0GRE7 Cluster: Putative uncharacterized protein; n=1; ... 31 6.7
UniRef50_A2G6I6 Cluster: Putative uncharacterized protein; n=1; ... 31 6.7
UniRef50_Q50244 Cluster: Surface layer protein B; n=4; Methanosa... 31 6.7
UniRef50_Q2T420 Cluster: ImcF-related family; n=14; Burkholderia... 30 8.8
UniRef50_Q9CAI9 Cluster: Putative uncharacterized protein F28P22... 30 8.8
UniRef50_P20061 Cluster: Transcobalamin-1 precursor; n=7; Euther... 30 8.8
>UniRef50_UPI0000DB7985 Cluster: PREDICTED: similar to Collagen type
IV CG4145-PA, isoform A isoform 1; n=1; Apis
mellifera|Rep: PREDICTED: similar to Collagen type IV
CG4145-PA, isoform A isoform 1 - Apis mellifera
Length = 1913
Score = 146 bits (355), Expect = 6e-35
Identities = 69/117 (58%), Positives = 85/117 (72%), Gaps = 24/117 (20%)
Frame = +3
Query: 63 TDYLTGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLG----------- 209
+DYLTGILLV+HSQ +++P C+ GH+KLW+GYSLL+ DG+E+AH+QDLG
Sbjct: 1661 SDYLTGILLVKHSQSQLLPVCDAGHIKLWEGYSLLFTDGDERAHSQDLGKSETYIAIDSK 1720
Query: 210 -------------YAGSCVRKFSTMPFLFCDLNDVCNYASRNDRSYWLSTGQPIPMM 341
YAGSCVRKFSTMPFLFCD+N+VC+Y +R DRSYWLST PIPMM
Sbjct: 1721 FFPRFSYDLVPFRYAGSCVRKFSTMPFLFCDINNVCHYGNRGDRSYWLSTTSPIPMM 1777
Score = 56.8 bits (131), Expect = 9e-08
Identities = 31/91 (34%), Positives = 46/91 (50%), Gaps = 2/91 (2%)
Frame = +3
Query: 54 CATTDYLTGILLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVR 230
C + +L V HSQ +P C G LW GYS L++ + Q L +GSC+
Sbjct: 1791 CVVCEVPANVLAV-HSQSLNIPDCPQGWTGLWIGYSFLMHTGAGAQGGGQSLSSSGSCLE 1849
Query: 231 KFSTMPFLFCDLN-DVCNYASRNDRSYWLST 320
F PF+ C+ N C+Y N+ S+W++T
Sbjct: 1850 DFRATPFIECNGNKGQCHY-YMNEISFWMAT 1879
>UniRef50_Q4S0I4 Cluster: Chromosome 2 SCAF14781, whole genome shotgun
sequence; n=5; Euteleostomi|Rep: Chromosome 2 SCAF14781,
whole genome shotgun sequence - Tetraodon nigroviridis
(Green puffer)
Length = 1468
Score = 143 bits (346), Expect = 8e-34
Identities = 60/88 (68%), Positives = 71/88 (80%)
Frame = +3
Query: 78 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
G LLV+HSQ E +P C G KLW GYSLLY++G EKAHNQDLG AGSC+ +FSTMPFL+
Sbjct: 1276 GYLLVKHSQTEQIPMCPVGMAKLWSGYSLLYMEGQEKAHNQDLGLAGSCLPRFSTMPFLY 1335
Query: 258 CDLNDVCNYASRNDRSYWLSTGQPIPMM 341
C+ D+C YASRND+SYWLST P+PMM
Sbjct: 1336 CNPGDICYYASRNDKSYWLSTTAPLPMM 1363
Score = 59.3 bits (137), Expect = 2e-08
Identities = 30/82 (36%), Positives = 43/82 (52%), Gaps = 2/82 (2%)
Frame = +3
Query: 81 ILLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
+ + HSQ +PQC G LW GYS L++ + Q L GSC+ F T PF+
Sbjct: 1385 VAIAVHSQDITIPQCPVGWRSLWIGYSFLMHTAAGNEGGGQSLSSPGSCLEDFRTTPFIE 1444
Query: 258 CD-LNDVCNYASRNDRSYWLST 320
C+ C+Y + N S+WLS+
Sbjct: 1445 CNGAKGTCHYFA-NKHSFWLSS 1465
>UniRef50_P08120 Cluster: Collagen alpha-1(IV) chain precursor; n=5;
Diptera|Rep: Collagen alpha-1(IV) chain precursor -
Drosophila melanogaster (Fruit fly)
Length = 1775
Score = 142 bits (343), Expect = 2e-33
Identities = 62/95 (65%), Positives = 73/95 (76%)
Frame = +3
Query: 57 ATTDYLTGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKF 236
A DYLTGIL+ RHSQ E VP C GH +LW GYSLLY+DGN+ AHNQDL GSCV +F
Sbjct: 1547 AALDYLTGILITRHSQSETVPACSAGHTELWTGYSLLYVDGNDYAHNQDL---GSCVPRF 1603
Query: 237 STMPFLFCDLNDVCNYASRNDRSYWLSTGQPIPMM 341
ST+P L C N+VCNYASRND+++WL+T IPMM
Sbjct: 1604 STLPVLSCGQNNVCNYASRNDKTFWLTTNAAIPMM 1638
Score = 42.7 bits (96), Expect = 0.002
Identities = 27/89 (30%), Positives = 39/89 (43%), Gaps = 2/89 (2%)
Frame = +3
Query: 54 CATTDYLTGILLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVR 230
C + ++ V HSQ VP C G LW GYS L++ Q L GSC+
Sbjct: 1652 CVVCEAPANVIAV-HSQTIEVPDCPNGWEGLWIGYSFLMHTAVGNGGGGQALQSPGSCLE 1710
Query: 231 KFSTMPFLFCD-LNDVCNYASRNDRSYWL 314
F PF+ C+ C++ S+W+
Sbjct: 1711 DFRATPFIECNGAKGTCHF-YETMTSFWM 1738
>UniRef50_UPI0000DA44CD Cluster: PREDICTED: similar to procollagen,
type IV, alpha 6; n=1; Rattus norvegicus|Rep: PREDICTED:
similar to procollagen, type IV, alpha 6 - Rattus
norvegicus
Length = 1405
Score = 140 bits (338), Expect = 7e-33
Identities = 58/88 (65%), Positives = 73/88 (82%)
Frame = +3
Query: 78 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
G LV+HSQ E VP C G +LW GYSLL+++G EKAHNQDLG+AGSC+ +FSTMPF++
Sbjct: 1248 GYTLVKHSQSEHVPPCPIGMSQLWVGYSLLFVEGQEKAHNQDLGFAGSCLPRFSTMPFIY 1307
Query: 258 CDLNDVCNYASRNDRSYWLSTGQPIPMM 341
C++N+VC+YA RND+SYWLST PIPMM
Sbjct: 1308 CNINEVCHYARRNDKSYWLSTTAPIPMM 1335
>UniRef50_UPI0000DBF028 Cluster: UPI0000DBF028 related cluster; n=9;
Rattus norvegicus|Rep: UPI0000DBF028 UniRef100 entry -
Rattus norvegicus
Length = 1549
Score = 140 bits (338), Expect = 7e-33
Identities = 58/88 (65%), Positives = 73/88 (82%)
Frame = +3
Query: 78 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
G LV+HSQ E VP C G +LW GYSLL+++G EKAHNQDLG+AGSC+ +FSTMPF++
Sbjct: 1412 GYTLVKHSQSEHVPPCPIGMSQLWVGYSLLFVEGQEKAHNQDLGFAGSCLPRFSTMPFIY 1471
Query: 258 CDLNDVCNYASRNDRSYWLSTGQPIPMM 341
C++N+VC+YA RND+SYWLST PIPMM
Sbjct: 1472 CNINEVCHYARRNDKSYWLSTTAPIPMM 1499
>UniRef50_UPI0000613E3C Cluster: Collagen alpha-2(IV) chain; n=2;
Bos taurus|Rep: Collagen alpha-2(IV) chain - Bos Taurus
Length = 227
Score = 139 bits (336), Expect = 1e-32
Identities = 60/88 (68%), Positives = 69/88 (78%)
Frame = +3
Query: 78 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
G LLV+HSQ + P C G KLW GYSLLY +G EKAHNQDLG AGSC+ +FSTMPFL+
Sbjct: 4 GYLLVKHSQTDKEPMCPVGMNKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLY 63
Query: 258 CDLNDVCNYASRNDRSYWLSTGQPIPMM 341
C+ DVC YASRND+SYWLST P+PMM
Sbjct: 64 CNPGDVCYYASRNDKSYWLSTTAPLPMM 91
Score = 54.4 bits (125), Expect = 5e-07
Identities = 28/82 (34%), Positives = 42/82 (51%), Gaps = 2/82 (2%)
Frame = +3
Query: 81 ILLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
+ + HSQ +P C G LW GYS L++ ++ Q L GSC+ F PF+
Sbjct: 113 VAIAVHSQDVSIPHCPAGWRSLWIGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFIE 172
Query: 258 CD-LNDVCNYASRNDRSYWLST 320
C+ C+Y + N S+WL+T
Sbjct: 173 CNGARGTCHYYA-NKYSFWLTT 193
>UniRef50_P08572 Cluster: Collagen alpha-2(IV) chain precursor
[Contains: Canstatin]; n=48; Tetrapoda|Rep: Collagen
alpha-2(IV) chain precursor [Contains: Canstatin] - Homo
sapiens (Human)
Length = 1712
Score = 139 bits (336), Expect = 1e-32
Identities = 60/88 (68%), Positives = 69/88 (78%)
Frame = +3
Query: 78 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
G LLV+HSQ + P C G KLW GYSLLY +G EKAHNQDLG AGSC+ +FSTMPFL+
Sbjct: 1489 GYLLVKHSQTDQEPMCPVGMNKLWSGYSLLYFEGQEKAHNQDLGLAGSCLARFSTMPFLY 1548
Query: 258 CDLNDVCNYASRNDRSYWLSTGQPIPMM 341
C+ DVC YASRND+SYWLST P+PMM
Sbjct: 1549 CNPGDVCYYASRNDKSYWLSTTAPLPMM 1576
Score = 53.6 bits (123), Expect = 8e-07
Identities = 29/82 (35%), Positives = 42/82 (51%), Gaps = 2/82 (2%)
Frame = +3
Query: 81 ILLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
I + HSQ +P C G LW GYS L++ ++ Q L GSC+ F PF+
Sbjct: 1598 IAIAVHSQDVSIPHCPAGWRSLWIGYSFLMHTAAGDEGGGQSLVSPGSCLEDFRATPFIE 1657
Query: 258 CD-LNDVCNYASRNDRSYWLST 320
C+ C+Y + N S+WL+T
Sbjct: 1658 CNGGRGTCHYYA-NKYSFWLTT 1678
>UniRef50_Q4SZ69 Cluster: Chromosome undetermined SCAF11805, whole
genome shotgun sequence; n=2; Tetraodon
nigroviridis|Rep: Chromosome undetermined SCAF11805,
whole genome shotgun sequence - Tetraodon nigroviridis
(Green puffer)
Length = 471
Score = 138 bits (334), Expect = 2e-32
Identities = 57/87 (65%), Positives = 72/87 (82%)
Frame = +3
Query: 78 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
G L+ RHSQ + VP C G ++DGYSLLY+ GNE+AH QDLG AGSC+R+FSTMPF+F
Sbjct: 207 GFLITRHSQAQDVPYCPDGTNLIYDGYSLLYVQGNERAHGQDLGTAGSCLRRFSTMPFMF 266
Query: 258 CDLNDVCNYASRNDRSYWLSTGQPIPM 338
C++N+VCN+ASRND SYWLST +P+PM
Sbjct: 267 CNINNVCNFASRNDYSYWLSTPEPMPM 293
Score = 41.9 bits (94), Expect(2) = 2e-06
Identities = 19/47 (40%), Positives = 26/47 (55%)
Frame = +3
Query: 198 QDLGYAGSCVRKFSTMPFLFCDLNDVCNYASRNDRSYWLSTGQPIPM 338
Q L GSC+ +F + PF+ C CNY N S+WL+T +P M
Sbjct: 398 QALASPGSCLEEFRSAPFIECHGRGTCNYYG-NSYSFWLATVEPSEM 443
Score = 29.9 bits (64), Expect(2) = 2e-06
Identities = 15/49 (30%), Positives = 22/49 (44%)
Frame = +3
Query: 54 CATTDYLTGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQ 200
CA + ++ V HSQ +P C LW GYS + + + H Q
Sbjct: 310 CAVCEAPAMVIAV-HSQTIQIPTCPANWEALWIGYSFMMVGRDTHTHIQ 357
>UniRef50_Q4SZ73 Cluster: Chromosome undetermined SCAF11805, whole
genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome undetermined SCAF11805, whole genome shotgun
sequence - Tetraodon nigroviridis (Green puffer)
Length = 1026
Score = 137 bits (331), Expect = 5e-32
Identities = 59/88 (67%), Positives = 68/88 (77%)
Frame = +3
Query: 78 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
G LV+HSQ VP C G KLWDGYSLLY++G EKAHNQDLG GSC+ +FST+PFL+
Sbjct: 804 GYTLVKHSQDAQVPMCPQGMAKLWDGYSLLYVEGQEKAHNQDLGQPGSCLPRFSTIPFLY 863
Query: 258 CDLNDVCNYASRNDRSYWLSTGQPIPMM 341
C N+VC YASRND+SYWLST IPMM
Sbjct: 864 CSPNEVCYYASRNDKSYWLSTTASIPMM 891
Score = 58.0 bits (134), Expect = 4e-08
Identities = 30/80 (37%), Positives = 41/80 (51%), Gaps = 2/80 (2%)
Frame = +3
Query: 96 HSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCD-LN 269
HSQ +P C PG LW GYS L++ + Q L GSC+ F PF+ C+
Sbjct: 918 HSQDMTIPTCPPGWRSLWIGYSFLMHTAAGAEGGGQSLVSPGSCLEDFRATPFIECNGAK 977
Query: 270 DVCNYASRNDRSYWLSTGQP 329
C+Y + N S+WL+T P
Sbjct: 978 GTCHYFA-NKYSFWLTTVDP 996
>UniRef50_P29400 Cluster: Collagen alpha-5(IV) chain precursor; n=61;
Eumetazoa|Rep: Collagen alpha-5(IV) chain precursor -
Homo sapiens (Human)
Length = 1685
Score = 136 bits (329), Expect = 9e-32
Identities = 55/93 (59%), Positives = 75/93 (80%)
Frame = +3
Query: 60 TTDYLTGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFS 239
T+ G L+ RHSQ PQC G +++++G+SLLY+ GN++AH QDLG AGSC+R+FS
Sbjct: 1455 TSSVAHGFLITRHSQTTDAPQCPQGTLQVYEGFSLLYVQGNKRAHGQDLGTAGSCLRRFS 1514
Query: 240 TMPFLFCDLNDVCNYASRNDRSYWLSTGQPIPM 338
TMPF+FC++N+VCN+ASRND SYWLST +P+PM
Sbjct: 1515 TMPFMFCNINNVCNFASRNDYSYWLSTPEPMPM 1547
Score = 61.3 bits (142), Expect = 4e-09
Identities = 30/90 (33%), Positives = 46/90 (51%), Gaps = 1/90 (1%)
Frame = +3
Query: 54 CATTDYLTGILLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVR 230
CA + +++ HSQ +P C G LW GYS +++ + Q L GSC+
Sbjct: 1564 CAVCE-APAVVIAVHSQTIQIPHCPQGWDSLWIGYSFMMHTSAGAEGSGQALASPGSCLE 1622
Query: 231 KFSTMPFLFCDLNDVCNYASRNDRSYWLST 320
+F + PF+ C CNY + N S+WL+T
Sbjct: 1623 EFRSAPFIECHGRGTCNYYA-NSYSFWLAT 1651
>UniRef50_Q07265 Cluster: 3 alpha procollagen; n=4; Strongylocentrotus
purpuratus|Rep: 3 alpha procollagen - Strongylocentrotus
purpuratus (Purple sea urchin)
Length = 1752
Score = 134 bits (323), Expect = 5e-31
Identities = 53/88 (60%), Positives = 69/88 (78%)
Frame = +3
Query: 75 TGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFL 254
+G + RHSQ +PQC G K+W GYSLL++ GNE+ H QDLG GSC+++FSTMPFL
Sbjct: 1527 SGFFITRHSQTTSIPQCPQGTAKMWHGYSLLFVQGNERGHGQDLGKPGSCLKRFSTMPFL 1586
Query: 255 FCDLNDVCNYASRNDRSYWLSTGQPIPM 338
FC++N+VC+ ASRND SYWLST +P+PM
Sbjct: 1587 FCNINNVCHVASRNDYSYWLSTTEPMPM 1614
Score = 49.2 bits (112), Expect = 2e-05
Identities = 27/80 (33%), Positives = 39/80 (48%), Gaps = 1/80 (1%)
Frame = +3
Query: 84 LLVRHSQREVVPQCEPGHVKLWDGYSLLYIDG-NEKAHNQDLGYAGSCVRKFSTMPFLFC 260
+L HSQ +P C LW GYS + G + Q L GSC+ F + PF+ C
Sbjct: 1640 VLTVHSQTVNIPDCPDRWGVLWIGYSFMMHTGPGGEGSGQMLSSPGSCLEDFRSSPFIEC 1699
Query: 261 DLNDVCNYASRNDRSYWLST 320
+ CNY + ++WLS+
Sbjct: 1700 HGDGKCNYYA-TTYTFWLSS 1718
>UniRef50_P53420 Cluster: Collagen alpha-4(IV) chain precursor; n=36;
Euteleostomi|Rep: Collagen alpha-4(IV) chain precursor -
Homo sapiens (Human)
Length = 1690
Score = 134 bits (323), Expect = 5e-31
Identities = 56/91 (61%), Positives = 70/91 (76%)
Frame = +3
Query: 69 YLTGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMP 248
YL G LLV HSQ + P C G +LW GYSLLY++G EKAHNQDLG AGSC+ FST+P
Sbjct: 1462 YLGGFLLVLHSQTDQEPTCPLGMPRLWTGYSLLYLEGQEKAHNQDLGLAGSCLPVFSTLP 1521
Query: 249 FLFCDLNDVCNYASRNDRSYWLSTGQPIPMM 341
F +C+++ VC+YA RNDRSYWL++ P+PMM
Sbjct: 1522 FAYCNIHQVCHYAQRNDRSYWLASAAPLPMM 1552
Score = 50.0 bits (114), Expect = 1e-05
Identities = 27/77 (35%), Positives = 39/77 (50%), Gaps = 2/77 (2%)
Frame = +3
Query: 96 HSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCD-LN 269
HSQ + +P C LW GYS L++ ++ Q L GSC+ F PFL C
Sbjct: 1579 HSQDQSIPPCPQTWRSLWIGYSFLMHTGAGDQGGGQALMSPGSCLEDFRAAPFLECQGRQ 1638
Query: 270 DVCNYASRNDRSYWLST 320
C++ + N S+WL+T
Sbjct: 1639 GTCHFFA-NKYSFWLTT 1654
>UniRef50_Q26640 Cluster: Alpha2(IV)-like collagen; n=4;
Strongylocentrotus purpuratus|Rep: Alpha2(IV)-like
collagen - Strongylocentrotus purpuratus (Purple sea
urchin)
Length = 1747
Score = 133 bits (322), Expect = 6e-31
Identities = 56/88 (63%), Positives = 70/88 (79%)
Frame = +3
Query: 78 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
G + RHSQ VP C G V+LW G+S+L+ GN AH+QDLG AGSC+++FSTMPFLF
Sbjct: 1526 GHFITRHSQSRNVPSCPAGTVELWRGFSVLFSMGNGHAHHQDLGDAGSCLQRFSTMPFLF 1585
Query: 258 CDLNDVCNYASRNDRSYWLSTGQPIPMM 341
C+ N+VCNYASRNDRSYWL+T +P+PMM
Sbjct: 1586 CNFNNVCNYASRNDRSYWLTTNEPLPMM 1613
Score = 59.7 bits (138), Expect = 1e-08
Identities = 28/77 (36%), Positives = 39/77 (50%)
Frame = +3
Query: 87 LVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCDL 266
L HSQ + +PQC G LW GYS + Q L GSC+ F PF+ C+
Sbjct: 1637 LAIHSQSQEIPQCPGGWRSLWTGYSFTMYTAASEGGGQGLESVGSCLENFRATPFIECNG 1696
Query: 267 NDVCNYASRNDRSYWLS 317
C++ S N+ S+WL+
Sbjct: 1697 RGNCHFFS-NEYSFWLT 1712
>UniRef50_P55787 Cluster: Collagen alpha-4(IV) chain; n=46;
Eumetazoa|Rep: Collagen alpha-4(IV) chain - Oryctolagus
cuniculus (Rabbit)
Length = 623
Score = 133 bits (322), Expect = 6e-31
Identities = 55/91 (60%), Positives = 71/91 (78%)
Frame = +3
Query: 69 YLTGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMP 248
YL+G LLV HSQ + P C G +LW GYSLLY++G EKAHNQDLG AGSC+ FST+P
Sbjct: 395 YLSGFLLVLHSQTDQEPACPMGMPRLWTGYSLLYLEGQEKAHNQDLGLAGSCLPIFSTLP 454
Query: 249 FLFCDLNDVCNYASRNDRSYWLSTGQPIPMM 341
F +C+++ VC+YA RND+SYWL++ P+PMM
Sbjct: 455 FAYCNIHQVCHYAQRNDKSYWLASAGPLPMM 485
Score = 52.8 bits (121), Expect = 1e-06
Identities = 28/80 (35%), Positives = 41/80 (51%), Gaps = 2/80 (2%)
Frame = +3
Query: 96 HSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCD-LN 269
HSQ + +P C LW GYS L++ ++ Q L GSC+ F PFL C
Sbjct: 512 HSQDQSIPPCPRAWRSLWIGYSFLMHTGAGDQGGGQALMSPGSCLEDFRAAPFLECQGRQ 571
Query: 270 DVCNYASRNDRSYWLSTGQP 329
C++ + N+ S+WL+T P
Sbjct: 572 GTCHFFA-NEYSFWLTTVPP 590
>UniRef50_UPI00015A592A Cluster: Type IV collagen alpha 4 chain; n=3;
Danio rerio|Rep: Type IV collagen alpha 4 chain - Danio
rerio
Length = 1639
Score = 127 bits (307), Expect = 4e-29
Identities = 56/87 (64%), Positives = 66/87 (75%)
Frame = +3
Query: 75 TGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFL 254
TG LLV HSQ VP C G +LW+GYSLLY++G E+AH QDLG AGSC+ FSTMPF
Sbjct: 1416 TGFLLVMHSQSRYVPTCPAGLTQLWNGYSLLYLEGQERAHTQDLGQAGSCLPVFSTMPFS 1475
Query: 255 FCDLNDVCNYASRNDRSYWLSTGQPIP 335
C++ D C+YASRND+SYWLST PIP
Sbjct: 1476 CCNM-DTCDYASRNDKSYWLSTNAPIP 1501
Score = 48.4 bits (110), Expect = 3e-05
Identities = 24/76 (31%), Positives = 40/76 (52%), Gaps = 2/76 (2%)
Frame = +3
Query: 96 HSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCD-LN 269
HSQ + P C P LW G+S ++Y ++ Q L GSC++ F + PF+ C
Sbjct: 1530 HSQDRLDPVCPPKWRSLWTGFSFMMYTGSGDEGGGQSLTSTGSCLQDFRSQPFVECQGPR 1589
Query: 270 DVCNYASRNDRSYWLS 317
C+Y + + S+W++
Sbjct: 1590 GTCSYFA-SIYSFWMT 1604
>UniRef50_Q4TZW9 Cluster: Type IV collagen alpha 4 chain; n=3; Danio
rerio|Rep: Type IV collagen alpha 4 chain - Danio rerio
(Zebrafish) (Brachydanio rerio)
Length = 240
Score = 127 bits (307), Expect = 4e-29
Identities = 56/87 (64%), Positives = 66/87 (75%)
Frame = +3
Query: 75 TGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFL 254
TG LLV HSQ VP C G +LW+GYSLLY++G E+AH QDLG AGSC+ FSTMPF
Sbjct: 14 TGFLLVMHSQSRYVPTCPAGLTQLWNGYSLLYLEGQERAHTQDLGQAGSCLPVFSTMPFS 73
Query: 255 FCDLNDVCNYASRNDRSYWLSTGQPIP 335
C++ D C+YASRND+SYWLST PIP
Sbjct: 74 CCNM-DTCDYASRNDKSYWLSTNAPIP 99
Score = 48.4 bits (110), Expect = 3e-05
Identities = 24/76 (31%), Positives = 40/76 (52%), Gaps = 2/76 (2%)
Frame = +3
Query: 96 HSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCD-LN 269
HSQ + P C P LW G+S ++Y ++ Q L GSC++ F + PF+ C
Sbjct: 128 HSQDRLDPVCPPKWRNLWTGFSFMMYTGSGDEGGGQSLTSTGSCLQDFRSQPFVECQGPR 187
Query: 270 DVCNYASRNDRSYWLS 317
C+Y + + S+W++
Sbjct: 188 GTCSYFA-SIYSFWMT 202
>UniRef50_Q28084 Cluster: Collagen alpha-3(IV) chain; n=13; cellular
organisms|Rep: Collagen alpha-3(IV) chain - Bos taurus
(Bovine)
Length = 471
Score = 126 bits (305), Expect = 7e-29
Identities = 52/89 (58%), Positives = 67/89 (75%)
Frame = +3
Query: 72 LTGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPF 251
+ G + RHSQ +P C G L+ G+SLL++ GNE+AH QDLG GSC+++F+TMPF
Sbjct: 244 MRGFVFTRHSQTTAIPSCPEGTEPLYSGFSLLFVQGNEQAHGQDLGTLGSCLQRFTTMPF 303
Query: 252 LFCDLNDVCNYASRNDRSYWLSTGQPIPM 338
LFC++NDVCN+ASRND SYWLST IPM
Sbjct: 304 LFCNINDVCNFASRNDYSYWLSTPAMIPM 332
Score = 62.9 bits (146), Expect = 1e-09
Identities = 29/84 (34%), Positives = 43/84 (51%), Gaps = 1/84 (1%)
Frame = +3
Query: 81 ILLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
I + HSQ +P C G + LW G+S +++ + Q L GSC+ +F PF+
Sbjct: 357 IAIAVHSQTTDIPPCPAGWISLWKGFSFIMFTSAGSEGAGQALASPGSCLEEFRASPFIE 416
Query: 258 CDLNDVCNYASRNDRSYWLSTGQP 329
C CNY S N S+WL++ P
Sbjct: 417 CHGRGTCNYYS-NSYSFWLASLDP 439
>UniRef50_P17139-2 Cluster: Isoform b of P17139 ; n=2; Caenorhabditis
elegans|Rep: Isoform b of P17139 - Caenorhabditis elegans
Length = 1502
Score = 125 bits (301), Expect = 2e-28
Identities = 52/89 (58%), Positives = 67/89 (75%), Gaps = 1/89 (1%)
Frame = +3
Query: 78 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
G +HSQ VPQC PG +LW+GYSLLY+ GN +A QDLG GSC+ KF+TMPF+F
Sbjct: 1278 GFTFAKHSQTTAVPQCPPGASQLWEGYSLLYVQGNGRASGQDLGQPGSCLSKFNTMPFMF 1337
Query: 258 CDLNDVCNYASRNDRSYWLSTGQPI-PMM 341
C++N VC+ +SRND S+WLST +P+ PMM
Sbjct: 1338 CNMNSVCHVSSRNDYSFWLSTDEPMTPMM 1366
Score = 60.5 bits (140), Expect = 7e-09
Identities = 32/89 (35%), Positives = 45/89 (50%), Gaps = 1/89 (1%)
Frame = +3
Query: 54 CATTDYLTGILLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVR 230
CA + T I+ V HSQ VPQC G +W GYS +++ + Q L GSC+
Sbjct: 1381 CAVCEVPTQIIAV-HSQDTSVPQCPQGWSGMWTGYSFVMHTAAGAEGTGQSLQSPGSCLE 1439
Query: 231 KFSTMPFLFCDLNDVCNYASRNDRSYWLS 317
+F +PF+ C CNY + N +W S
Sbjct: 1440 EFRAVPFIECHGRGTCNYYATN-HGFWPS 1467
>UniRef50_Q01955 Cluster: Collagen alpha-3(IV) chain precursor
(Goodpasture antigen) [Contains: Tumstatin]; n=61;
Eumetazoa|Rep: Collagen alpha-3(IV) chain precursor
(Goodpasture antigen) [Contains: Tumstatin] - Homo
sapiens (Human)
Length = 1670
Score = 124 bits (299), Expect = 4e-28
Identities = 50/87 (57%), Positives = 66/87 (75%)
Frame = +3
Query: 78 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
G + RHSQ +P C G V L+ G+S L++ GN++AH QDLG GSC+++F+TMPFLF
Sbjct: 1445 GFVFTRHSQTTAIPSCPEGTVPLYSGFSFLFVQGNQRAHGQDLGTLGSCLQRFTTMPFLF 1504
Query: 258 CDLNDVCNYASRNDRSYWLSTGQPIPM 338
C++NDVCN+ASRND SYWLST +PM
Sbjct: 1505 CNVNDVCNFASRNDYSYWLSTPALMPM 1531
Score = 63.3 bits (147), Expect = 1e-09
Identities = 30/84 (35%), Positives = 43/84 (51%), Gaps = 1/84 (1%)
Frame = +3
Query: 81 ILLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
I + HSQ +P C G + LW G+S +++ + Q L GSC+ +F PFL
Sbjct: 1556 IAIAVHSQTTDIPPCPHGWISLWKGFSFIMFTSAGSEGTGQALASPGSCLEEFRASPFLE 1615
Query: 258 CDLNDVCNYASRNDRSYWLSTGQP 329
C CNY S N S+WL++ P
Sbjct: 1616 CHGRGTCNYYS-NSYSFWLASLNP 1638
>UniRef50_Q7PVR6 Cluster: ENSANGP00000016652; n=3;
Endopterygota|Rep: ENSANGP00000016652 - Anopheles
gambiae str. PEST
Length = 461
Score = 121 bits (292), Expect = 3e-27
Identities = 49/87 (56%), Positives = 66/87 (75%)
Frame = +3
Query: 78 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
G L RHSQ+ +P+C KLWDGYSL+ + + ++ QDLG AGSC+R+FSTMPF+F
Sbjct: 184 GYLFARHSQKVTIPECPINTYKLWDGYSLVNVIASSRSVGQDLGAAGSCLRRFSTMPFMF 243
Query: 258 CDLNDVCNYASRNDRSYWLSTGQPIPM 338
CD+N+VCNYAS ND + WL+T +P+PM
Sbjct: 244 CDINNVCNYASNNDDTIWLATPEPMPM 270
Score = 52.0 bits (119), Expect = 3e-06
Identities = 27/89 (30%), Positives = 43/89 (48%), Gaps = 1/89 (1%)
Frame = +3
Query: 54 CATTDYLTGILLVRHSQREVVPQCEPGHVKLWDGYSL-LYIDGNEKAHNQDLGYAGSCVR 230
C+ + T ++ + HSQ +P C G +LW GYS ++ N QD GSC+
Sbjct: 287 CSVCESNTRVMAL-HSQSMSIPDCPEGWEELWLGYSYAMHTSDNSGGFGQDFVSPGSCME 345
Query: 231 KFSTMPFLFCDLNDVCNYASRNDRSYWLS 317
+F P + C + CN+ S+WL+
Sbjct: 346 EFRPQPVIECHGHGTCNFYD-GISSFWLT 373
>UniRef50_UPI00015B49AB Cluster: PREDICTED: similar to alpha-5 type IV
collagen; n=1; Nasonia vitripennis|Rep: PREDICTED:
similar to alpha-5 type IV collagen - Nasonia vitripennis
Length = 1702
Score = 118 bits (285), Expect = 2e-26
Identities = 49/87 (56%), Positives = 64/87 (73%)
Frame = +3
Query: 78 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
G RHSQ ++P C VK+WDG+SLL++ GN AH QDLG GSC++KFS MPF
Sbjct: 1374 GFYFARHSQSAMIPVCPRNTVKMWDGFSLLHVMGNSYAHAQDLGTPGSCLKKFSVMPFNV 1433
Query: 258 CDLNDVCNYASRNDRSYWLSTGQPIPM 338
C+LN+VC+YA+RND SYWLS+ + +PM
Sbjct: 1434 CNLNNVCDYANRNDYSYWLSSNEQMPM 1460
Score = 63.7 bits (148), Expect = 8e-10
Identities = 31/80 (38%), Positives = 43/80 (53%), Gaps = 1/80 (1%)
Frame = +3
Query: 84 LLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFC 260
L+V HSQ +P+C G +LW GYS L++ D Q L GSC+ +F PF+ C
Sbjct: 1486 LIVMHSQSMAIPECPGGWEELWAGYSFLMHRDAGAAGGGQPLSSPGSCLEEFRARPFIEC 1545
Query: 261 DLNDVCNYASRNDRSYWLST 320
CN+ S SYW++T
Sbjct: 1546 RGQGTCNFFS-TAVSYWMAT 1564
>UniRef50_UPI000065E567 Cluster: Homolog of Brachydanio rerio
"Collagen, type I, alpha 3.; n=1; Takifugu rubripes|Rep:
Homolog of Brachydanio rerio "Collagen, type I, alpha 3.
- Takifugu rubripes
Length = 1426
Score = 117 bits (281), Expect = 6e-26
Identities = 53/88 (60%), Positives = 63/88 (71%)
Frame = +3
Query: 78 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
G LLV HSQ VP+C G LW GYSL Y+ G + AH QDLG AGSC+R FSTMPF +
Sbjct: 1205 GFLLVIHSQSVQVPKCPDGSSLLWVGYSLAYLKGQKNAHAQDLGQAGSCLRVFSTMPFSY 1264
Query: 258 CDLNDVCNYASRNDRSYWLSTGQPIPMM 341
C+ C+++SRND+SYWLST PIPMM
Sbjct: 1265 CN-KAACHFSSRNDKSYWLSTAAPIPMM 1291
Score = 57.6 bits (133), Expect = 5e-08
Identities = 31/83 (37%), Positives = 43/83 (51%), Gaps = 2/83 (2%)
Frame = +3
Query: 87 LVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCD 263
+V HSQ P C G LW GYS L++ ++ Q L +GSC++ F T P + C
Sbjct: 1315 VVFHSQEHTAPACPQGWRSLWTGYSFLMHTGAGDEGSGQALTSSGSCLKNFQTHPIIECQ 1374
Query: 264 -LNDVCNYASRNDRSYWLSTGQP 329
C+Y S N S+WL+T P
Sbjct: 1375 GPQGSCHYFS-NLYSFWLTTISP 1396
>UniRef50_UPI00006608B5 Cluster: Homolog of Homo sapiens "Tumstatin;
n=1; Takifugu rubripes|Rep: Homolog of Homo sapiens
"Tumstatin - Takifugu rubripes
Length = 1374
Score = 116 bits (280), Expect = 8e-26
Identities = 48/93 (51%), Positives = 63/93 (67%)
Frame = +3
Query: 54 CATTDYLTGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRK 233
C + L RHSQ +P+C G +++ GYSLL+I+GN +AH QDLG GSC+ +
Sbjct: 1143 CIDAPHQDSFLFTRHSQELSIPECPVGSTEVYSGYSLLFINGNNRAHGQDLGTLGSCLPR 1202
Query: 234 FSTMPFLFCDLNDVCNYASRNDRSYWLSTGQPI 332
F+TMPFLFC+ + C YASRND SYWLST Q +
Sbjct: 1203 FTTMPFLFCNTDSTCRYASRNDYSYWLSTNQVV 1235
Score = 65.7 bits (153), Expect = 2e-10
Identities = 34/96 (35%), Positives = 49/96 (51%), Gaps = 1/96 (1%)
Frame = +3
Query: 54 CATTDYLTGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDG-NEKAHNQDLGYAGSCVR 230
C+ + T ++ + HSQ VVP C G + LW GYS + G + Q L GSC+
Sbjct: 1254 CSVCETRTNVIAI-HSQTSVVPDCPLGWLPLWVGYSFVMETGVGAEGSGQPLASPGSCLE 1312
Query: 231 KFSTMPFLFCDLNDVCNYASRNDRSYWLSTGQPIPM 338
+F +PF+ C CNY + + SYWL+ P M
Sbjct: 1313 QFRKIPFIECHGRGTCNYYT-DSYSYWLAALSPHDM 1347
>UniRef50_Q58FS7 Cluster: Type IV collagen alpha 3 chain; n=2; Danio
rerio|Rep: Type IV collagen alpha 3 chain - Danio rerio
(Zebrafish) (Brachydanio rerio)
Length = 244
Score = 114 bits (275), Expect = 3e-25
Identities = 47/85 (55%), Positives = 61/85 (71%)
Frame = +3
Query: 78 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
G L RHSQ V+P+C G +L+ GYSLL+I+GN + H QDLG GSC+ F+TMPF+
Sbjct: 17 GFLFTRHSQTTVIPECPAGSKRLYTGYSLLFINGNNRGHGQDLGTLGSCLPMFNTMPFMV 76
Query: 258 CDLNDVCNYASRNDRSYWLSTGQPI 332
C+ ++ C YASRND SYWLST P+
Sbjct: 77 CNRDETCRYASRNDYSYWLSTDTPM 101
Score = 60.9 bits (141), Expect = 5e-09
Identities = 29/90 (32%), Positives = 48/90 (53%), Gaps = 1/90 (1%)
Frame = +3
Query: 54 CATTDYLTGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDG-NEKAHNQDLGYAGSCVR 230
C+ + + ++ + HSQ +PQC G + LW+GYS + G + Q L GSC+
Sbjct: 120 CSVCEAIANVIAI-HSQTINIPQCPVGWLSLWEGYSFVMQTGVGAEGSGQPLVSPGSCLE 178
Query: 231 KFSTMPFLFCDLNDVCNYASRNDRSYWLST 320
+F +PF+ C CN+ + SYWL++
Sbjct: 179 QFRKIPFIECHGRGTCNFYP-DSYSYWLAS 207
>UniRef50_Q4SB07 Cluster: Chromosome undetermined SCAF14677, whole
genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep:
Chromosome undetermined SCAF14677, whole genome shotgun
sequence - Tetraodon nigroviridis (Green puffer)
Length = 856
Score = 114 bits (274), Expect = 4e-25
Identities = 47/93 (50%), Positives = 62/93 (66%)
Frame = +3
Query: 54 CATTDYLTGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRK 233
C L RHSQ +P+C G +++ GYSLL+I+GN +AH QDLG GSC+ +
Sbjct: 727 CTDAPQQDSFLFTRHSQELYIPECPAGSTQVYSGYSLLFINGNNRAHGQDLGTLGSCLPR 786
Query: 234 FSTMPFLFCDLNDVCNYASRNDRSYWLSTGQPI 332
F+TMPFLFC+ + C YASRND SYWLST + +
Sbjct: 787 FTTMPFLFCNTDRTCRYASRNDYSYWLSTNKMV 819
>UniRef50_Q9GQB1 Cluster: Type IV collagen alpha 1 chain precursor;
n=1; Hydra vulgaris|Rep: Type IV collagen alpha 1 chain
precursor - Hydra attenuata (Hydra) (Hydra vulgaris)
Length = 1723
Score = 113 bits (272), Expect = 7e-25
Identities = 48/83 (57%), Positives = 59/83 (71%)
Frame = +3
Query: 78 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
G LV+HSQ VP C G +W+GYS LY GNE+A QDLG GSC+++FSTMPFLF
Sbjct: 1501 GFYLVKHSQSIKVPSCPAGMQTMWEGYSFLYAQGNERAFGQDLGQPGSCLKRFSTMPFLF 1560
Query: 258 CDLNDVCNYASRNDRSYWLSTGQ 326
CD+ + C ASRND S+WLST +
Sbjct: 1561 CDIQNKCVVASRNDYSFWLSTAE 1583
Score = 55.2 bits (127), Expect = 3e-07
Identities = 29/80 (36%), Positives = 40/80 (50%), Gaps = 1/80 (1%)
Frame = +3
Query: 84 LLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFC 260
+L HSQ E+ P+C G LW G+S L+Y + Q L +GSC+ F P++ C
Sbjct: 1611 VLAVHSQSELDPKCPDGWENLWTGFSFLMYNSAGAQGSGQLLSSSGSCLEDFRVNPYIEC 1670
Query: 261 DLNDVCNYASRNDRSYWLST 320
C Y S+WLST
Sbjct: 1671 HGRGTCWYYGPT-LSFWLST 1689
>UniRef50_A7T3G2 Cluster: Predicted protein; n=1; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 590
Score = 109 bits (261), Expect = 2e-23
Identities = 43/76 (56%), Positives = 59/76 (77%)
Frame = +3
Query: 78 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
G +V+HSQ P+C P + KLWDGYSLLY+ G++ +H QDLG AGSC+++F+TMP+L+
Sbjct: 475 GFYIVKHSQTTTPPECPPTYDKLWDGYSLLYVQGHDVSHGQDLGQAGSCLKRFTTMPYLY 534
Query: 258 CDLNDVCNYASRNDRS 305
C++ CNYASRND S
Sbjct: 535 CNIFGKCNYASRNDYS 550
>UniRef50_A7T795 Cluster: Predicted protein; n=1; Nematostella
vectensis|Rep: Predicted protein - Nematostella
vectensis
Length = 331
Score = 108 bits (259), Expect = 3e-23
Identities = 42/74 (56%), Positives = 58/74 (78%)
Frame = +3
Query: 78 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
G +V+HSQ P+C P + KLWDGYSLLY+ G++ +H QDLG AGSC+++F+TMP+L+
Sbjct: 256 GFYIVKHSQTTTPPECPPTYDKLWDGYSLLYVQGHDVSHGQDLGQAGSCLKRFTTMPYLY 315
Query: 258 CDLNDVCNYASRND 299
C++ CNYASRND
Sbjct: 316 CNIFGKCNYASRND 329
>UniRef50_UPI000065E566 Cluster: Homolog of Homo sapiens "Splice
Isoform 1 of Collagen alpha 3; n=1; Takifugu
rubripes|Rep: Homolog of Homo sapiens "Splice Isoform 1
of Collagen alpha 3 - Takifugu rubripes
Length = 1258
Score = 104 bits (250), Expect = 3e-22
Identities = 45/84 (53%), Positives = 56/84 (66%)
Frame = +3
Query: 84 LLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCD 263
++ RHSQ +P C G L+ GYS L++ N++ H QDLG GSC+ FSTMPFL CD
Sbjct: 1039 MIARHSQSIHIPVCPCGTSLLFSGYSFLFMHANDRVHGQDLGTPGSCLPHFSTMPFLVCD 1098
Query: 264 LNDVCNYASRNDRSYWLSTGQPIP 335
C YASRND SYWLSTG+ +P
Sbjct: 1099 TESNCRYASRNDYSYWLSTGKALP 1122
Score = 57.6 bits (133), Expect = 5e-08
Identities = 30/93 (32%), Positives = 45/93 (48%), Gaps = 1/93 (1%)
Frame = +3
Query: 54 CATTDYLTGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDG-NEKAHNQDLGYAGSCVR 230
CA + + ++ V HSQ +P C V LW GYS + G +Q L GSC+
Sbjct: 1140 CAVCETTSNVIAV-HSQTTQIPDCPQDWVSLWSGYSFVMQTGIGADGSSQPLISPGSCLE 1198
Query: 231 KFSTMPFLFCDLNDVCNYASRNDRSYWLSTGQP 329
F +PF+ C CNY + S+W+++ P
Sbjct: 1199 TFRKVPFIECHGRGTCNYYP-DSYSFWMASLDP 1230
>UniRef50_Q9VMV5 Cluster: CG16858-PA; n=6; Schizophora|Rep: CG16858-PA
- Drosophila melanogaster (Fruit fly)
Length = 1940
Score = 99.5 bits (237), Expect = 1e-20
Identities = 42/87 (48%), Positives = 57/87 (65%)
Frame = +3
Query: 78 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
G + RHSQ VPQC LW+GYSL +A QDLG +GSC+ +F+TMP++
Sbjct: 1515 GFIFARHSQSVHVPQCPANTNLLWEGYSLSGNVAASRAVGQDLGQSGSCMMRFTTMPYML 1574
Query: 258 CDLNDVCNYASRNDRSYWLSTGQPIPM 338
CD+ +VC++A ND S WLST +P+PM
Sbjct: 1575 CDITNVCHFAQNNDDSLWLSTAEPMPM 1601
Score = 48.4 bits (110), Expect = 3e-05
Identities = 28/89 (31%), Positives = 42/89 (47%), Gaps = 1/89 (1%)
Frame = +3
Query: 54 CATTDYLTGILLVRHSQREVVPQCEPGHVKLWDGYS-LLYIDGNEKAHNQDLGYAGSCVR 230
C + T I+ + HSQ +P C G ++W GYS + N Q+L GSC+
Sbjct: 1618 CVVCETTTRIIAL-HSQSMSIPDCPGGWEEMWTGYSYFMSTLDNVGGVGQNLVSPGSCLE 1676
Query: 231 KFSTMPFLFCDLNDVCNYASRNDRSYWLS 317
+F P + C + CNY S+WL+
Sbjct: 1677 EFRAQPVIECHGHGRCNYYDAL-ASFWLT 1704
>UniRef50_O09238 Cluster: Collagen type IV; n=2; Pseudocorticium
jarrei|Rep: Collagen type IV - Pseudocorticium jarrei
Length = 854
Score = 88.2 bits (209), Expect = 3e-17
Identities = 43/90 (47%), Positives = 53/90 (58%), Gaps = 3/90 (3%)
Frame = +3
Query: 78 GILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
G+LLV HSQ +PQC + +LW GYSLL + GN QDLG GSC+ F MP +
Sbjct: 629 GLLLVVHSQTTNIPQCPNDYTRLWVGYSLLQLTGNGLGVGQDLGDPGSCMPSFHPMPVVR 688
Query: 258 CDLNDVCNYASRNDRSYWLSTG---QPIPM 338
C+ C +A R D SYWLST PIP+
Sbjct: 689 CNPMQRCEFARRKDESYWLSTNATRPPIPV 718
Score = 55.6 bits (128), Expect = 2e-07
Identities = 30/79 (37%), Positives = 40/79 (50%), Gaps = 1/79 (1%)
Frame = +3
Query: 81 ILLVRHSQREVVPQCEPGHVKLWDGYSLL-YIDGNEKAHNQDLGYAGSCVRKFSTMPFLF 257
I + HSQ VP C PG V LW G+S L + Q L GSC++ F + PF+
Sbjct: 738 ISIAVHSQDSNVPDCFPGWVTLWTGFSFLQQTAAQAEGTGQGLESPGSCLQHFRSTPFIG 797
Query: 258 CDLNDVCNYASRNDRSYWL 314
C C+Y S + SYW+
Sbjct: 798 CGGRGQCSYDSVSG-SYWM 815
>UniRef50_Q5BYE6 Cluster: SJCHGC08138 protein; n=1; Schistosoma
japonicum|Rep: SJCHGC08138 protein - Schistosoma
japonicum (Blood fluke)
Length = 206
Score = 74.5 bits (175), Expect = 4e-13
Identities = 37/84 (44%), Positives = 47/84 (55%), Gaps = 2/84 (2%)
Frame = +3
Query: 75 TGILLVRHSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPF- 251
+G L HSQ P C ++ GYSL+ + G++ + DLG GSC+RKFS MPF
Sbjct: 92 SGFLFTVHSQDSQPPSCPIYTTPVYTGYSLVTLQGDDDSTTMDLGTPGSCLRKFSIMPFA 151
Query: 252 -LFCDLNDVCNYASRNDRSYWLST 320
F +N C RN RSYWLST
Sbjct: 152 NCFAKVNGNCQINMRNGRSYWLST 175
>UniRef50_Q5C3P1 Cluster: SJCHGC06113 protein; n=2;
Platyhelminthes|Rep: SJCHGC06113 protein - Schistosoma
japonicum (Blood fluke)
Length = 587
Score = 65.7 bits (153), Expect = 2e-10
Identities = 38/99 (38%), Positives = 54/99 (54%), Gaps = 7/99 (7%)
Frame = +3
Query: 63 TDYLTGILLVRHSQREVVPQ--CEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKF 236
T+Y + IL RH Q V C G KL+ GYS + G + + DLG SC+ KF
Sbjct: 355 TNY-SSILFARHYQTPFVENLTCPGGTNKLFTGYSYVMGGGVDDLVSMDLGTPSSCLSKF 413
Query: 237 STMPFLFCDLNDVCNYASRNDRSYWLST-----GQPIPM 338
S++P C+ + C + R++RSYWL+T QPIP+
Sbjct: 414 SSLPMTQCERDTTCQSSMRHERSYWLATLVPRSEQPIPV 452
Score = 43.6 bits (98), Expect = 9e-04
Identities = 25/79 (31%), Positives = 36/79 (45%), Gaps = 1/79 (1%)
Frame = +3
Query: 96 HSQREVVPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCDLN-D 272
HSQ E + C +LW G SL+ Q L GSC+ F P + C+ N
Sbjct: 475 HSQGETLQPCPSTWTELWTGVSLILHTSGAHGGGQQLSSPGSCMEHFRYSPVIECNNNVG 534
Query: 273 VCNYASRNDRSYWLSTGQP 329
+C+Y S + + Y+L P
Sbjct: 535 MCHYWS-DAKVYYLRALNP 552
>UniRef50_Q4WVM5 Cluster: DNA polymerase gamma; n=13;
Pezizomycotina|Rep: DNA polymerase gamma - Aspergillus
fumigatus (Sartorya fumigata)
Length = 1135
Score = 33.1 bits (72), Expect = 1.2
Identities = 17/54 (31%), Positives = 24/54 (44%)
Frame = +3
Query: 147 WDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCDLNDVCNYASRNDRSY 308
WDGY L + D V+KF P + CD+++ N RNDR +
Sbjct: 533 WDGYPLTWSD----KFGWTFKVPKDQVKKFENQPVVLCDMSEEKNLELRNDRKH 582
>UniRef50_Q4T5R1 Cluster: Chromosome undetermined SCAF9151, whole
genome shotgun sequence; n=1; Tetraodon
nigroviridis|Rep: Chromosome undetermined SCAF9151,
whole genome shotgun sequence - Tetraodon nigroviridis
(Green puffer)
Length = 704
Score = 31.9 bits (69), Expect = 2.9
Identities = 14/36 (38%), Positives = 22/36 (61%)
Frame = +3
Query: 114 VPQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGS 221
VP+ PGHV+L G +L G+ + H ++LG A +
Sbjct: 113 VPERLPGHVRLVHGQQVLPGQGDVRLHREELGIAAA 148
>UniRef50_A1G1L8 Cluster: Glycoside hydrolase family 2,
immunoglobulin-like beta-sandwich; n=1; Stenotrophomonas
maltophilia R551-3|Rep: Glycoside hydrolase family 2,
immunoglobulin-like beta-sandwich - Stenotrophomonas
maltophilia R551-3
Length = 895
Score = 31.5 bits (68), Expect = 3.8
Identities = 21/63 (33%), Positives = 33/63 (52%)
Frame = +3
Query: 141 KLWDGYSLLYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCDLNDVCNYASRNDRSYWLST 320
++W GY L+ GN+ Q +G G V +S+ P DL++ N ++R D+ YW
Sbjct: 518 RVWQGYVDLF--GNDL--RQVVGEEGLGVPYWSSSPSN--DLDEKANDSTRGDKHYWQVW 571
Query: 321 GQP 329
G P
Sbjct: 572 GNP 574
>UniRef50_A3UAL4 Cluster: Dihydrolipoamide dehydrogenase; n=1;
Croceibacter atlanticus HTCC2559|Rep: Dihydrolipoamide
dehydrogenase - Croceibacter atlanticus HTCC2559
Length = 179
Score = 30.7 bits (66), Expect = 6.7
Identities = 15/40 (37%), Positives = 20/40 (50%)
Frame = +3
Query: 165 LYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCDLNDVCNY 284
L+IDGN N D GY + +F +P F DV +Y
Sbjct: 121 LFIDGNYDLSNLDFGYTDDQIFRFVVVPSDFALTVDVTDY 160
>UniRef50_A0GRE7 Cluster: Putative uncharacterized protein; n=1;
Burkholderia phytofirmans PsJN|Rep: Putative
uncharacterized protein - Burkholderia phytofirmans PsJN
Length = 734
Score = 30.7 bits (66), Expect = 6.7
Identities = 22/58 (37%), Positives = 29/58 (50%), Gaps = 4/58 (6%)
Frame = +3
Query: 150 DGYSLLYIDG--NEKAHNQDLGYAGSC--VRKFSTMPFLFCDLNDVCNYASRNDRSYW 311
+G+ L++D N + H D Y G VR F+TM DLN V ASRN+ W
Sbjct: 602 NGHDRLHLDAVSNAEGHVLDANYNGLTGHVRLFATM---LLDLNKVDVIASRNELQQW 656
>UniRef50_A2G6I6 Cluster: Putative uncharacterized protein; n=1;
Trichomonas vaginalis G3|Rep: Putative uncharacterized
protein - Trichomonas vaginalis G3
Length = 336
Score = 30.7 bits (66), Expect = 6.7
Identities = 15/34 (44%), Positives = 22/34 (64%)
Frame = -3
Query: 226 THEPAYPRSWLCAFSFPSIYNSE*PSHNFTCPGS 125
T++P + R ++CA S P IY S+ PS +F C S
Sbjct: 187 TYDP-FCRYFICASSRPKIYVSKHPSLDFVCEAS 219
>UniRef50_Q50244 Cluster: Surface layer protein B; n=4;
Methanosarcina|Rep: Surface layer protein B -
Methanosarcina mazei (Methanosarcina frisia)
Length = 652
Score = 30.7 bits (66), Expect = 6.7
Identities = 11/41 (26%), Positives = 21/41 (51%)
Frame = -3
Query: 244 IVLNFRTHEPAYPRSWLCAFSFPSIYNSE*PSHNFTCPGSH 122
+ + F+ + P +W +F + N + P H +T PGS+
Sbjct: 412 LTVTFKDNSSGSPTAWNWSFGDGAYSNEKYPKHTYTAPGSY 452
>UniRef50_Q2T420 Cluster: ImcF-related family; n=14;
Burkholderia|Rep: ImcF-related family - Burkholderia
thailandensis (strain E264 / ATCC 700388 / DSM 13276
/CIP 106301)
Length = 1164
Score = 30.3 bits (65), Expect = 8.8
Identities = 12/45 (26%), Positives = 21/45 (46%)
Frame = +2
Query: 23 KWPTWPTRCTMRHYRLLNWYIISATQPKGSCTSM*TRTCKIMGRL 157
+W W T H +L WY++ ++ G TS+ + + G L
Sbjct: 103 RWKRWVGTLTREHRAMLPWYLVLGSEGSGK-TSLVAKAVSVSGSL 146
>UniRef50_Q9CAI9 Cluster: Putative uncharacterized protein F28P22.5;
n=1; Arabidopsis thaliana|Rep: Putative uncharacterized
protein F28P22.5 - Arabidopsis thaliana (Mouse-ear
cress)
Length = 697
Score = 30.3 bits (65), Expect = 8.8
Identities = 20/64 (31%), Positives = 29/64 (45%), Gaps = 4/64 (6%)
Frame = +3
Query: 117 PQCEPGHVKLWDGYSLLYIDGNEKAHNQDLGYAGSCVRK-FSTMPFLFCDL---NDVCNY 284
PQC HV+L D + D ++AH L + G C RK + D+ N + NY
Sbjct: 43 PQCVLIHVQLGDTGGHFHQDNPDEAHEFFLPFRGFCARKGIIAKEVILHDIDISNAIVNY 102
Query: 285 ASRN 296
+ N
Sbjct: 103 ITNN 106
>UniRef50_P20061 Cluster: Transcobalamin-1 precursor; n=7;
Eutheria|Rep: Transcobalamin-1 precursor - Homo sapiens
(Human)
Length = 433
Score = 30.3 bits (65), Expect = 8.8
Identities = 18/58 (31%), Positives = 32/58 (55%), Gaps = 2/58 (3%)
Frame = +3
Query: 165 LYIDGNEKAHNQDLGYAGSCVRKFSTMPFLFCDLNDVCNYASRNDRSYW--LSTGQPI 332
+++ EKA + G + + S P++ C + +C A+ NDR+YW LS G+P+
Sbjct: 357 VFLSVMEKAQKMNDTIFGFTMEERSWGPYITC-IQGLC--ANNNDRTYWELLSGGEPL 411
Database: uniref50
Posted date: Oct 5, 2007 11:19 AM
Number of letters in database: 575,637,011
Number of sequences in database: 1,657,284
Lambda K H
0.318 0.134 0.401
Gapped
Lambda K H
0.279 0.0580 0.190
Matrix: BLOSUM62
Gap Penalties: Existence: 9, Extension: 2
Number of Hits to DB: 337,567,428
Number of Sequences: 1657284
Number of extensions: 6332133
Number of successful extensions: 14215
Number of sequences better than 10.0: 43
Number of HSP's better than 10.0 without gapping: 13866
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 14184
length of database: 575,637,011
effective HSP length: 89
effective length of database: 428,138,735
effective search space used: 10275329640
frameshift window, decay const: 40, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 37 (14.9 bits)
X3: 62 (25.0 bits)
S1: 41 (21.7 bits)
- SilkBase 1999-2023 -