BLASTX 2.2.12 [Aug-07-2005] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= an--0367 (785 letters) Database: uniref50 1,657,284 sequences; 575,637,011 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value UniRef50_UPI0000DB7127 Cluster: PREDICTED: similar to Collagen a... 104 2e-21 UniRef50_UPI0000E4A0B0 Cluster: PREDICTED: similar to procollage... 90 7e-17 UniRef50_P20849 Cluster: Collagen alpha-1(IX) chain precursor; n... 75 2e-12 UniRef50_Q4S052 Cluster: Chromosome 21 SCAF14785, whole genome s... 73 6e-12 UniRef50_Q7Q1W5 Cluster: ENSANGP00000020977; n=1; Anopheles gamb... 67 4e-10 UniRef50_UPI0000F20786 Cluster: PREDICTED: hypothetical protein,... 66 9e-10 UniRef50_Q07092 Cluster: Collagen alpha-1(XVI) chain precursor; ... 62 1e-08 UniRef50_UPI0000E801E7 Cluster: PREDICTED: similar to alpha 1 ty... 58 3e-07 UniRef50_Q4S053 Cluster: Chromosome 21 SCAF14785, whole genome s... 58 3e-07 UniRef50_Q14993 Cluster: Collagen alpha-1(XIX) chain precursor (... 54 4e-06 UniRef50_UPI0000F2075E Cluster: PREDICTED: hypothetical protein;... 54 5e-06 UniRef50_Q08BP6 Cluster: Zgc:153049; n=2; Danio rerio|Rep: Zgc:1... 52 2e-05 UniRef50_Q4RQA8 Cluster: Chromosome 17 SCAF15006, whole genome s... 48 2e-04 UniRef50_UPI0000DBF905 Cluster: UPI0000DBF905 related cluster; n... 48 3e-04 UniRef50_Q4RP14 Cluster: Chromosome 10 SCAF15009, whole genome s... 48 3e-04 UniRef50_UPI0000DBF903 Cluster: UPI0000DBF903 related cluster; n... 48 4e-04 UniRef50_Q8NFW1 Cluster: Collagen alpha-1(XXII) chain; n=23; Eut... 47 5e-04 UniRef50_UPI0000F20056 Cluster: PREDICTED: similar to gag-pol fu... 46 8e-04 UniRef50_Q66S51 Cluster: Collagen repeat-containing protein; n=1... 46 0.001 UniRef50_Q05707 Cluster: Collagen alpha-1(XIV) chain precursor; ... 46 0.001 UniRef50_UPI00006600F1 Cluster: Collagen alpha-1(XIV) chain prec... 46 0.001 UniRef50_Q99715 Cluster: Collagen alpha-1(XII) chain precursor; ... 44 0.003 UniRef50_UPI00006A0426 Cluster: Collagen alpha-1(XIV) chain prec... 44 0.004 UniRef50_Q4SH63 Cluster: Chromosome 8 SCAF14587, whole genome sh... 44 0.004 UniRef50_Q8C4S5 Cluster: 16 days embryo head cDNA, RIKEN full-le... 42 0.013 UniRef50_P12107 Cluster: Collagen alpha-1(XI) chain precursor; n... 42 0.013 UniRef50_UPI000065E422 Cluster: Collagen alpha-1(XI) chain precu... 41 0.031 UniRef50_Q4RWT3 Cluster: Chromosome 15 SCAF14981, whole genome s... 41 0.040 UniRef50_UPI0000F1F770 Cluster: PREDICTED: similar to collagen, ... 40 0.071 UniRef50_P25940 Cluster: Collagen alpha-3(V) chain precursor; n=... 40 0.071 UniRef50_UPI0000F2130C Cluster: PREDICTED: hypothetical protein;... 39 0.12 UniRef50_UPI00015A783F Cluster: LOC553362 protein; n=1; Danio re... 38 0.22 UniRef50_UPI000069F8A6 Cluster: Collagen alpha-1(XII) chain prec... 36 1.2 UniRef50_UPI000069F8A4 Cluster: Collagen alpha-1(XII) chain prec... 36 1.2 UniRef50_Q4SD21 Cluster: Chromosome 14 SCAF14645, whole genome s... 36 1.2 UniRef50_UPI0000F20887 Cluster: PREDICTED: similar to alpha 1 ty... 35 2.0 UniRef50_Q4SMJ3 Cluster: Chromosome 18 SCAF14547, whole genome s... 35 2.0 UniRef50_UPI0000E48BC9 Cluster: PREDICTED: similar to alpha 1 (V... 35 2.7 UniRef50_A7S1U8 Cluster: Predicted protein; n=1; Nematostella ve... 35 2.7 UniRef50_Q8WU66 Cluster: Protein TSPEAR precursor; n=29; Euteleo... 35 2.7 UniRef50_A3RYW5 Cluster: Co-activator of prophage gene expressio... 34 3.5 UniRef50_Q1LYI5 Cluster: Novel collagen protein; n=2; Danio reri... 34 4.6 UniRef50_Q1VR50 Cluster: Glycosyl hydrolase, family 16; n=1; Psy... 34 4.6 UniRef50_Q4SXE3 Cluster: Chromosome undetermined SCAF12445, whol... 33 6.1 UniRef50_Q2G901 Cluster: Transcriptional regulator, LysR family;... 33 6.1 UniRef50_Q4E0W9 Cluster: Putative uncharacterized protein; n=3; ... 33 6.1 UniRef50_Q17A79 Cluster: Collagen alpha chain, anopheles; n=7; C... 33 6.1 UniRef50_A0DPA5 Cluster: Chromosome undetermined scaffold_59, wh... 33 6.1 UniRef50_Q19350 Cluster: Drosophila crumbs homolog protein 1; n=... 33 8.1 >UniRef50_UPI0000DB7127 Cluster: PREDICTED: similar to Collagen alpha-1(IX) chain precursor; n=1; Apis mellifera|Rep: PREDICTED: similar to Collagen alpha-1(IX) chain precursor - Apis mellifera Length = 884 Score = 104 bits (250), Expect = 2e-21 Identities = 70/239 (29%), Positives = 119/239 (49%), Gaps = 13/239 (5%) Frame = +1 Query: 70 VLVFLSILFS-YSNGQADNFTTMAPFARSFCSPNSK-DSDFQTVDLIAVYRLDRSDT--T 237 V +FL + F+ Y G+ T+ F C + + D QT+D I+ +LD +T T Sbjct: 5 VWIFLFVQFTQYIFGEE----TIPDFVHGPCEGYKQGEDDLQTIDFISKLKLDLLETHYT 60 Query: 238 GVTLVQGSQDLQMAYRIGGGANLTLPLKEVFPGGLPNRFSVEGTFNARG-QRRPWSLLRA 414 GVT V+GS +Q AYR+ ++TLP K +FP GLP FSV TF AR + W +++ Sbjct: 61 GVTRVRGSNRMQTAYRLEKDTDITLPTKNIFPNGLPEEFSVVCTFRARKLSKYTWHIIKI 120 Query: 415 --RSNNVLFSLILMPEPRKVAVLVQGSR---AVFKSPELFTTGWHKLHVAVANRSVHIAV 579 N F + + P+ + + + V+ F + +F WHK+ + + I + Sbjct: 121 VDMENEPQFLIAMNPKGQTLDLFVKSDEMQIVSFSADHIFDKNWHKIDIGAFKDRLVIYI 180 Query: 580 DCVELNPVNISAY---DFSNATSLTIVSNDDGTPAPIDL*WLSLSCDRYVLKEDSCEEI 747 DC + ++ + S++ +S+ T PID+ W+ L+CD + ++CEE+ Sbjct: 181 DCEYVGTQDVKPWGPIKVDGEISISKMSHSKLT-VPIDIQWMVLNCDPTRPERETCEEL 238 >UniRef50_UPI0000E4A0B0 Cluster: PREDICTED: similar to procollagen, type IX, alpha 1, partial; n=2; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to procollagen, type IX, alpha 1, partial - Strongylocentrotus purpuratus Length = 993 Score = 89.8 bits (213), Expect = 7e-17 Identities = 58/206 (28%), Positives = 94/206 (45%), Gaps = 14/206 (6%) Frame = +1 Query: 172 KDSDFQTVDLIAVYRLDRSDTTGVTLVQGSQDLQMAYRIGGGANLTLPLKEVFPGGLPNR 351 +++D D+I ++ LD + GV +V GS D+Q AYR+ +++ P FP GLP+ Sbjct: 157 EENDLPGWDIIRMFELDSATIQGVRVVTGSDDVQRAYRLLKNHDISEPASNQFPQGLPDE 216 Query: 352 FSVEGTF---NARGQRRPW--SLLRARSNNVLFSLILMPEPRKVAVLVQGSRAVFKSPE- 513 FS TF + G W L+R R+ + LM E + + + ++ Sbjct: 217 FSFVSTFKLTDRTGDDEDWWLWLIRDRAGTPQIGIRLMGEEKALQFIYVNELGQLENVRF 276 Query: 514 -----LFTTGWHKLHVAVANRSVHIAVDCVELNPVNISAYDFSNATSLTIVS---NDDGT 669 LF WHKLH+AV+ + + VDC+ + V + + T++ + G Sbjct: 277 DNAMMLFDRNWHKLHMAVSKNRLDLFVDCLSIGSVALRTRGQVDTLGETLIGRKFSAGGG 336 Query: 670 PAPIDL*WLSLSCDRYVLKEDSCEEI 747 P L W+ + CDR D C EI Sbjct: 337 PVQFILQWMIIHCDRTFPTRDHCSEI 362 >UniRef50_P20849 Cluster: Collagen alpha-1(IX) chain precursor; n=85; Euteleostomi|Rep: Collagen alpha-1(IX) chain precursor - Homo sapiens (Human) Length = 921 Score = 74.9 bits (176), Expect = 2e-12 Identities = 49/203 (24%), Positives = 95/203 (46%), Gaps = 14/203 (6%) Frame = +1 Query: 181 DFQTVDLIAVYRLDRSDTT-GVTLVQGSQDLQMAYRIGGGANLTLPLKEVFPGGLPNRFS 357 D DLI+ +++D++ + + V GS LQ+AY++G + +P + ++P GLP +S Sbjct: 53 DLPGFDLISQFQVDKAASRRAIQRVVGSATLQVAYKLGNNVDFRIPTRNLYPSGLPEEYS 112 Query: 358 VEGTFNARGQ--RRPWSL--LRARSNNVLFSLILMPEPRKVAVLVQG-----SRAVFKS- 507 TF G ++ W++ ++ S + + + + V +G A F + Sbjct: 113 FLTTFRMTGSTLKKNWNIWQIQDSSGKEQVGIKINGQTQSVVFSYKGLDGSLQTAAFSNL 172 Query: 508 PELFTTGWHKLHVAVANRSVHIAVDC--VELNPVN-ISAYDFSNATSLTIVSNDDGTPAP 678 LF + WHK+ + V S + VDC +E P+ D L ++++ P Sbjct: 173 SSLFDSQWHKIMIGVERSSATLFVDCNRIESLPIKPRGPIDIDGFAVLGKLADNPQVSVP 232 Query: 679 IDL*WLSLSCDRYVLKEDSCEEI 747 +L W+ + CD + ++C E+ Sbjct: 233 FELQWMLIHCDPLRPRRETCHEL 255 >UniRef50_Q4S052 Cluster: Chromosome 21 SCAF14785, whole genome shotgun sequence; n=2; Tetraodontidae|Rep: Chromosome 21 SCAF14785, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 825 Score = 73.3 bits (172), Expect = 6e-12 Identities = 58/213 (27%), Positives = 89/213 (41%), Gaps = 14/213 (6%) Frame = +1 Query: 151 SFCSP-NSKDSDFQTVDLIAVYRLDRSDTTGVTLVQGSQDLQMAYRIGGGANLTLPLKEV 327 S C P S D DLI ++LD GV V GS LQ+AYR+ AN +P Sbjct: 3 SVCPPLRSGQDDLPGFDLITQFQLDVIPLKGVRKVDGSTSLQVAYRLDREANFQIPTMLN 62 Query: 328 FPGGLPNRFSVEGTFN--ARGQRRPWSL--LRARSNNVLFSLILMPEPRKVAVLV---QG 486 FP G P+ +S TF + W++ + N L L + + + + G Sbjct: 63 FPRGFPDEYSFMVTFRMIKNTVNKVWNVWQIMDEEGNKQAGLRLNGDQQALEYFLIDADG 122 Query: 487 SRAVFKSP---ELFTTGWHKLHVAVANRSVHIAVDC--VELNPV-NISAYDFSNATSLTI 648 + P LF T WHK+ + V V + VDC V++ P+ + T + Sbjct: 123 NLQTVTFPGLSVLFNTRWHKVMIGVERNQVTLYVDCQPVDMRPIRGKGPINTEGDTLIGR 182 Query: 649 VSNDDGTPAPIDL*WLSLSCDRYVLKEDSCEEI 747 + D +L W+ + CD + +SC E+ Sbjct: 183 LDTDPDASVVFELQWMLIHCDPKRAQRESCNEL 215 >UniRef50_Q7Q1W5 Cluster: ENSANGP00000020977; n=1; Anopheles gambiae str. PEST|Rep: ENSANGP00000020977 - Anopheles gambiae str. PEST Length = 438 Score = 67.3 bits (157), Expect = 4e-10 Identities = 61/223 (27%), Positives = 103/223 (46%), Gaps = 23/223 (10%) Frame = +1 Query: 175 DSDFQTVDLIAVYRLDRSDTTGVTL--VQGSQDLQMAYRIGGGANLTLPLKEVFPGGLPN 348 D D ++ DLI +RLD+ + + + +QG+ + Q AYR ANLT+ + FP GLP+ Sbjct: 37 DVDLRSFDLIREFRLDQIEASSKHMHRLQGTNEYQTAYRFEKEANLTMRSVDAFPLGLPH 96 Query: 349 RFSVEGTFNARGQ-RRPWSLLRARSNNVLFSLILMPEP-RKVAVL----VQGSRAV--FK 504 +FS E T+ + W L + L + P R++ + +G + + + Sbjct: 97 QFSFECTYRIEDEGESSWHLFEVTNEVQESQLAITLNPGRQILQIGLPATEGEQQIVEYH 156 Query: 505 SPELFTTGWHKLHVAVANRSVHIAVDCVELNP------VNISAYD-FSNATSLTIVSNDD 663 LF WHK+ + V N +++ VDC + V + D F A +S Sbjct: 157 HTTLFDHNWHKIMLGVTNDYLNLWVDCRPVRDTDGNLNVPLEPRDRFDVADGYVSISRFA 216 Query: 664 GT-----PAP-IDL*WLSLSCDRYVLKEDSCEEIDNPNYLIAT 774 T +P IDL W+ ++CD +C+E+ P Y +A+ Sbjct: 217 ETSVFEPESPIIDLQWMVMNCDPTRPARGNCDEL--PVYDVAS 257 >UniRef50_UPI0000F20786 Cluster: PREDICTED: hypothetical protein, partial; n=2; Danio rerio|Rep: PREDICTED: hypothetical protein, partial - Danio rerio Length = 372 Score = 66.1 bits (154), Expect = 9e-10 Identities = 51/179 (28%), Positives = 78/179 (43%), Gaps = 10/179 (5%) Frame = +1 Query: 166 NSKDSDFQTVDLIAVYRLDRSDTTGVTLVQGSQDLQMAYRIGGGANLTLPLKEVFPGGLP 345 N +D D DLI ++LD GV V+GS LQ+AYR+ AN +P + FP G P Sbjct: 123 NGQD-DLPGFDLITQFQLDVIPMKGVRKVEGSTPLQVAYRLDREANFQIPTRLNFPRGFP 181 Query: 346 NRFSVEGTFN--ARGQRRPWSLLRARSNNVLFSLILMPEPRKVAV-----LVQGSRAVFK 504 + +S TF + W+L + + L + + A+ ++G Sbjct: 182 DEYSFMTTFRMIKNTVNKVWNLWQVVDEDGLKQAGMRLNGDQQALEFFLTTLEGDVQTVT 241 Query: 505 SP---ELFTTGWHKLHVAVANRSVHIAVDCVELNPVNISAYDFSNATSLTIVSNDDGTP 672 P LF T WHK+ V V V + VDC +++ I + N T++ D P Sbjct: 242 FPGLSVLFNTKWHKVMVGVEKELVTLYVDCHQVDQKEIKRKGYVNTEGDTLIGRLDSDP 300 >UniRef50_Q07092 Cluster: Collagen alpha-1(XVI) chain precursor; n=38; cellular organisms|Rep: Collagen alpha-1(XVI) chain precursor - Homo sapiens (Human) Length = 1604 Score = 62.5 bits (145), Expect = 1e-08 Identities = 49/191 (25%), Positives = 82/191 (42%), Gaps = 11/191 (5%) Frame = +1 Query: 208 VYRLDRSDTTGVTLVQGSQDLQMAYRIGGGANLTLPLKEVFPGGLPNRFSVEGTF--NAR 381 ++RL T+ + ++ + + R+G A +T P + VFP GLP F++ T Sbjct: 54 IHRLSLMKTSAIKKIRNPKG-PLILRLGA-APVTQPTRRVFPRGLPEEFALVLTLLLKKH 111 Query: 382 GQRRPWSLLRARSNNVL--FSLILMPEPRKVAVLVQGS-----RAVFKSPELFTTGWHKL 540 ++ W L + N SL + + R + + QG +F P+LF WHKL Sbjct: 112 THQKTWYLFQVTDANGYPQISLEVNSQERSLELRAQGQDGDFVSCIFPVPQLFDLRWHKL 171 Query: 541 HVAVANRSVHIAVDCVELNPVNISAYDFSNATSLTIVSND--DGTPAPIDL*WLSLSCDR 714 ++VA R + VDC + + + D G P DL + + CD Sbjct: 172 MLSVAGRVASVHVDCSSASSQPLGPRRPMRPVGHVFLGLDAEQGKPVSFDLQQVHIYCDP 231 Query: 715 YVLKEDSCEEI 747 ++ E+ C EI Sbjct: 232 ELVLEEGCCEI 242 >UniRef50_UPI0000E801E7 Cluster: PREDICTED: similar to alpha 1 type XIX collagen; n=1; Gallus gallus|Rep: PREDICTED: similar to alpha 1 type XIX collagen - Gallus gallus Length = 890 Score = 57.6 bits (133), Expect = 3e-07 Identities = 48/168 (28%), Positives = 78/168 (46%), Gaps = 16/168 (9%) Frame = +1 Query: 292 GGANLTLPLKEVFPGGLPNRFSVEGTFNARGQRRP-----WSLLRARSNNVLFSLILMPE 456 G A L ++VFP GLP +S+ TF R + W +L R + S++L Sbjct: 81 GSAPLIRETRQVFPNGLPEEYSLVATFRIRRNTKKERWYIWQILNQRDIPEI-SVLLDGS 139 Query: 457 PRKVAVLVQGSRA-----VFKSPE---LFTTGWHKLHVAVANRSVHIAVDC--VELNPVN 606 + V + + ++ FKS E LF WHKL ++V + + + +DC +E + Sbjct: 140 KKVVEYMTKSAQGNILHYTFKSREIYPLFDRQWHKLGISVQSGIISLYLDCNLIERKQTD 199 Query: 607 IS-AYDFSNATSLTIVSNDDGTPAPIDL*WLSLSCDRYVLKEDSCEEI 747 DF T + + DD P I+L +++ C+ V ED+C EI Sbjct: 200 EKFTIDFQGRTLIASRAADD-KPVDIELYRITVYCNPKVATEDTCCEI 246 >UniRef50_Q4S053 Cluster: Chromosome 21 SCAF14785, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 21 SCAF14785, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 1380 Score = 57.6 bits (133), Expect = 3e-07 Identities = 47/177 (26%), Positives = 80/177 (45%), Gaps = 14/177 (7%) Frame = +1 Query: 265 DLQMAYRIGGGANLTLPLKEVFPGGLPNRFSVEGTFNAR--GQRRPWSLLRARSNNVL-- 432 D Q +Y G + +++FP GLP+ ++ TF R +R W L + + Sbjct: 237 DGQSSYVRLGTVPIVQQTEDIFPQGLPDEYAFVTTFKFRKTSRREDWYLWQVFDKYGIPQ 296 Query: 433 FSLILMPEPRKVAVLVQGS-----RAVFKSPE---LFTTGWHKLHVAVANRSVHIAVDCV 588 S+ L E + V G RAVFK+PE LF WHK+ ++V +SV + +DC Sbjct: 297 VSIRLDGENKAVEYNAVGLTKDAVRAVFKNPEVDNLFDRSWHKIALSVEAKSVSLFLDCK 356 Query: 589 ELNPVNISAYDFSNATSLTIVSND--DGTPAPIDL*WLSLSCDRYVLKEDSCEEIDN 753 + + I + + T++ D P DL + + CD + ++C ++ N Sbjct: 357 HIQTLPIEEREDIDIQGKTVIGKRLYDSVPIDFDLQRMMIYCDSKHAELETCCDLPN 413 >UniRef50_Q14993 Cluster: Collagen alpha-1(XIX) chain precursor (Collagen alpha-1(Y) chain); n=22; Euteleostomi|Rep: Collagen alpha-1(XIX) chain precursor (Collagen alpha-1(Y) chain) - Homo sapiens (Human) Length = 1142 Score = 54.0 bits (124), Expect = 4e-06 Identities = 40/168 (23%), Positives = 77/168 (45%), Gaps = 14/168 (8%) Frame = +1 Query: 292 GGANLTLPLKEVFPGGLPNRFSVEGTFNAR--GQRRPWSLLRA--RSNNVLFSLILMPEP 459 G A L ++FP GLP +SV F R ++ W L + + N S+++ Sbjct: 80 GSALLIRDTIKIFPKGLPEEYSVAAMFRVRRNAKKERWFLWQVLNQQNIPQISIVVDGGK 139 Query: 460 RKVAVLVQGSRA-----VFKSPEL---FTTGWHKLHVAVANRSVHIAVDCVELNPVNISA 615 + V + Q + +F++ EL F WHKL +++ ++ + + +DC + Sbjct: 140 KVVEFMFQATEGDVLNYIFRNRELRPLFDRQWHKLGISIQSQVISLYMDCNLIARRQTDE 199 Query: 616 YDFSNATSLTIVSN--DDGTPAPIDL*WLSLSCDRYVLKEDSCEEIDN 753 D + T+++ DG P I+L L + C ++ +++C EI + Sbjct: 200 KDTVDFHGRTVIATRASDGKPVDIELHQLKIYCSANLIAQETCCEISD 247 >UniRef50_UPI0000F2075E Cluster: PREDICTED: hypothetical protein; n=1; Danio rerio|Rep: PREDICTED: hypothetical protein - Danio rerio Length = 753 Score = 53.6 bits (123), Expect = 5e-06 Identities = 43/166 (25%), Positives = 75/166 (45%), Gaps = 14/166 (8%) Frame = +1 Query: 292 GGANLTLPLKEVFPGGLPNRFSVEGTFNAR--GQRRPWSLLRARSNNVL--FSLILMPEP 459 G + ++V P GLP+ ++ TF R ++ W L + + S+ L E Sbjct: 111 GALPIVQKTEDVLPQGLPDEYAFVTTFKFRKTSRKEDWYLWQVYDKYGIPQVSIRLDGEN 170 Query: 460 RKVAVLVQGS-----RAVFKSPE---LFTTGWHKLHVAVANRSVHIAVDCVELNPVNISA 615 R V G RAVF++PE LF WHK+ + V ++SV + +DC + + I Sbjct: 171 RAVEYNAVGLTKDAVRAVFRNPEVDNLFDRNWHKIGMRVDSKSVSLFLDCKHIETLPIEE 230 Query: 616 YDFSNATSLTIVSND--DGTPAPIDL*WLSLSCDRYVLKEDSCEEI 747 + + T++ D P DL + + CD ++++C +I Sbjct: 231 REDIDIQGKTVIGKRLYDSVPIDFDLQRMVIYCDGKQSEQETCCDI 276 >UniRef50_Q08BP6 Cluster: Zgc:153049; n=2; Danio rerio|Rep: Zgc:153049 - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 290 Score = 51.6 bits (118), Expect = 2e-05 Identities = 47/164 (28%), Positives = 72/164 (43%), Gaps = 15/164 (9%) Frame = +1 Query: 292 GGANLTLPLKEVFPGGLPNRFSVEGTFNARGQRRP--WSLLRA--RSNNVLFSLILMPEP 459 G L P + VFP GL + +S+ TF R + W +L+ + SLI+ Sbjct: 79 GSKPLFKPTESVFPNGLSHEYSIVATFRIRKTTKKDRWFVLQIFDKGGTSQVSLIVDGAK 138 Query: 460 RKVAVLVQGSRA-----VFKSPEL---FTTGWHKLHVAVANRSVHIAVDCVELN---PVN 606 + V L G VFK+ +L F +HKL V+V + +V I +DC + Sbjct: 139 KSVEFLALGFLKNSLLYVFKNRDLHALFDRQFHKLGVSVESNAVSIYLDCELIERQVTAE 198 Query: 607 ISAYDFSNATSLTIVSNDDGTPAPIDL*WLSLSCDRYVLKEDSC 738 S D S T +T +DG P ++L + + CD + D C Sbjct: 199 RSGIDVSGRTFIT-TRLEDGKPVDVELQEILVFCDSRIADLDRC 241 >UniRef50_Q4RQA8 Cluster: Chromosome 17 SCAF15006, whole genome shotgun sequence; n=2; Clupeocephala|Rep: Chromosome 17 SCAF15006, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 250 Score = 48.4 bits (110), Expect = 2e-04 Identities = 44/185 (23%), Positives = 75/185 (40%), Gaps = 13/185 (7%) Frame = +1 Query: 238 GVTLVQGSQDLQMAYRIGGGANLTLPLKEVFPGGLPNRFSVEGTFNARGQR--RPWSLLR 411 GV V GS +AYR+ +L +V+P GLP+ FS+ TF + W L + Sbjct: 36 GVQRVDGSGPAAVAYRLNPSIHLRRSTSDVYPDGLPSDFSIIATFKVTEDTAGKSWDLWQ 95 Query: 412 AR--SNNVLFSLILMPEPRKVAVLVQGSRA-----VFKSPE-LFTTGWHKLHVAVANRSV 567 L + R + F E +F WHKL ++V V Sbjct: 96 VSDPEGKEQVGLRFHGDTRSLDFFYTSPHTRKMVRTFSGVERVFDGEWHKLALSVKADQV 155 Query: 568 HIAVDCVELNPVNISAYD---FSNATSLTIVSNDDGTPAPIDL*WLSLSCDRYVLKEDSC 738 + +DC E+ ++ TS+ ++ D + + +DL + +SCD + + C Sbjct: 156 KLLIDCQEVKVESVDQLRPVVLPGYTSIVKRASGDRSMS-VDLQQMEVSCDPEKVHSEGC 214 Query: 739 EEIDN 753 E+ + Sbjct: 215 CELSS 219 >UniRef50_UPI0000DBF905 Cluster: UPI0000DBF905 related cluster; n=1; Rattus norvegicus|Rep: UPI0000DBF905 UniRef100 entry - Rattus norvegicus Length = 1513 Score = 48.0 bits (109), Expect = 3e-04 Identities = 38/156 (24%), Positives = 72/156 (46%), Gaps = 15/156 (9%) Frame = +1 Query: 319 KEVFPGGLPNRFSVEGTFNAR--GQRRPWSLLRARSNNVL--FSLILMPEPRKVAVLVQG 486 ++VFP GLP+ ++ TF R ++ W + + + S+ L E + V G Sbjct: 1 RDVFPQGLPDEYAFVTTFRFRKTSRKEDWYIWQVIDQYGIPQVSIRLDGENKAVEYNAVG 60 Query: 487 S-----RAVFKSPE---LFTTGWHKLHVAVANRSVHIAVDCVELNPVNISAYDFSNATSL 642 + R VF+ P+ LF WHK+ +++ ++V + +DC+ + + I + + Sbjct: 61 AMEDAVRVVFRGPQVDNLFDRDWHKMALSIQAQNVSLYIDCMLVQTLPIEERENIDIQGK 120 Query: 643 TIVSND--DGTPAPIDL*WLSLSCD-RYVLKEDSCE 741 T++ D P DL + + CD R+ E C+ Sbjct: 121 TVIGKRLYDSVPIDFDLQRIVIYCDSRHAELETCCD 156 >UniRef50_Q4RP14 Cluster: Chromosome 10 SCAF15009, whole genome shotgun sequence; n=2; root|Rep: Chromosome 10 SCAF15009, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 1450 Score = 48.0 bits (109), Expect = 3e-04 Identities = 35/131 (26%), Positives = 62/131 (47%), Gaps = 12/131 (9%) Frame = +1 Query: 238 GVTLVQGSQDLQMAYRIGGGANLTLPLKEVFPGGLPNRFSVEGTFN--ARGQRRPWSL-- 405 GV++ GS + +AYR+ + + P KE+ P GLP +++ F P+ + Sbjct: 911 GVSMEPGSFNSYIAYRVHKDSFINQPTKEIHPEGLPPSYTIVLLFRLLPDSPSEPFDIWQ 970 Query: 406 LRARSNNVLFSLILMPEPRKVAVLVQGSR-----AVFKSPE---LFTTGWHKLHVAVANR 561 + ++NN + L P + + + +R A F + +F +HKLH+ V+ Sbjct: 971 ISDKNNNPEVGVSLNPSSKTITFYNKDTRGEIQKATFNQEQVKRIFHGSFHKLHITVSPD 1030 Query: 562 SVHIAVDCVEL 594 V I VDC E+ Sbjct: 1031 KVKINVDCQEV 1041 >UniRef50_UPI0000DBF903 Cluster: UPI0000DBF903 related cluster; n=2; Rattus norvegicus|Rep: UPI0000DBF903 UniRef100 entry - Rattus norvegicus Length = 1113 Score = 47.6 bits (108), Expect = 4e-04 Identities = 38/156 (24%), Positives = 72/156 (46%), Gaps = 15/156 (9%) Frame = +1 Query: 319 KEVFPGGLPNRFSVEGTFNAR--GQRRPWSLLRARSNNVL--FSLILMPEPRKVAVLVQG 486 ++VFP GLP+ ++ TF R ++ W + + + S+ L E + V G Sbjct: 11 EDVFPQGLPDEYAFVTTFRFRKTSRKEDWYIWQVIDQYGIPQVSIRLDGENKAVEYNAVG 70 Query: 487 S-----RAVFKSPE---LFTTGWHKLHVAVANRSVHIAVDCVELNPVNISAYDFSNATSL 642 + R VF+ P+ LF WHK+ +++ ++V + +DC+ + + I + + Sbjct: 71 AMEDAVRVVFRGPQVDNLFDRDWHKMALSIQAQNVSLYIDCMLVQTLPIEERENIDIQGK 130 Query: 643 TIVSND--DGTPAPIDL*WLSLSCD-RYVLKEDSCE 741 T++ D P DL + + CD R+ E C+ Sbjct: 131 TVIGKRLYDSVPIDFDLQRIVIYCDSRHAELETCCD 166 >UniRef50_Q8NFW1 Cluster: Collagen alpha-1(XXII) chain; n=23; Euteleostomi|Rep: Collagen alpha-1(XXII) chain - Homo sapiens (Human) Length = 1626 Score = 47.2 bits (107), Expect = 5e-04 Identities = 46/205 (22%), Positives = 86/205 (41%), Gaps = 17/205 (8%) Frame = +1 Query: 220 DRSDTTGVTLVQGSQD--LQMAYRIGGGANLTLPLKEVFPGGLPNRFSVEGTFNAR--GQ 387 D D V + G ++ Q +Y G + ++VFP GLP+ ++ TF R + Sbjct: 247 DLMDLFSVKEILGKRENGAQSSYVRMGSFPVVQSTEDVFPQGLPDEYAFVTTFRFRKTSR 306 Query: 388 RRPWSLLRARSNNVL--FSLILMPEPRKVAVLVQGS-----RAVFKSP---ELFTTGWHK 537 + W + + + S+ L E + V G+ R VF+ +LF WHK Sbjct: 307 KEDWYIWQVIDQYGIPQVSIRLDGENKAVEYNAVGAMKDAVRVVFRGSRVNDLFDRDWHK 366 Query: 538 LHVAVANRSVHIAVDCVELNPVNISAYDFSNATSLTIVSND--DGTPAPIDL*WLSLSCD 711 + +++ ++V + +DC + + I + + T++ D P DL + + CD Sbjct: 367 MALSIQAQNVSLHIDCALVQTLPIEERENIDIQGKTVIGKRLYDSVPIDFDLQRIVIYCD 426 Query: 712 -RYVLKEDSCEEIDNPNYLIATSAP 783 R+ E C+ P + + P Sbjct: 427 SRHAELETCCDIPSGPCQVTVVTEP 451 >UniRef50_UPI0000F20056 Cluster: PREDICTED: similar to gag-pol fusion polyprotein; n=5; Danio rerio|Rep: PREDICTED: similar to gag-pol fusion polyprotein - Danio rerio Length = 2607 Score = 46.4 bits (105), Expect = 8e-04 Identities = 34/156 (21%), Positives = 74/156 (47%), Gaps = 12/156 (7%) Frame = +1 Query: 184 FQTVDLIAVYRLDRSDTTGVTLVQGSQDLQMAYRIGGGANLTLPLKEVFPGGLPNRFSVE 363 F+ +++ + S GV++ G+ + YR+ A ++ P + + P GLP+ +++ Sbjct: 1988 FRMMEMFGLAEKHYSSVQGVSMEPGTFNSFPCYRLHKDALVSQPTRYLHPEGLPSDYTIS 2047 Query: 364 GTFNARGQ--RRPWSL--LRARSNNVLFSLILMPEPRKVAVLVQGSRA-----VFKSPEL 516 F + + P++L + ++N L +IL + + + F+ PE+ Sbjct: 2048 MLFRILPETPQEPFALWEILDKNNEPLVGIILDNGGKTLTFFNSDYKGEFQTVTFEGPEI 2107 Query: 517 ---FTTGWHKLHVAVANRSVHIAVDCVELNPVNISA 615 F +HKLHVA++ + + +DC + +I+A Sbjct: 2108 KKIFYGSFHKLHVAISKTAAKVVIDCKTVGEKSINA 2143 >UniRef50_Q66S51 Cluster: Collagen repeat-containing protein; n=1; Oikopleura dioica|Rep: Collagen repeat-containing protein - Oikopleura dioica (Tunicate) Length = 1041 Score = 46.0 bits (104), Expect = 0.001 Identities = 51/206 (24%), Positives = 87/206 (42%), Gaps = 25/206 (12%) Frame = +1 Query: 169 SKDSDFQTVDLIAVYRLDRSDTTGVTLVQGSQDLQMAYRIGGGANLTLPLKEVFPGGLPN 348 ++ D D I+ + LDR GVT + GS D Q AYRI A+L V LP Sbjct: 102 TQGGDIPAFDFISQFGLDREQLLGVTEIAGSTDRQRAYRISNDASLVNAQTPV----LPK 157 Query: 349 RFSVEGTFNARGQRR--PWSLLRARSNNVLFSLILMPEPRKVAVLVQGS------RAVFK 504 FS+ W L + +S+ + + V+ +V S + K Sbjct: 158 EFSINCILRMPDYTTGLVWDLFKIYDGLAEYSVQINGRRQTVSFIVADSSGKVLINSELK 217 Query: 505 SPE-LFTTGWHKLHVAV----ANRSVHIAVDCVELNPVNIS-----AYDFSNATS----L 642 + E LF T WH+L + V ++SV + +D +++ + DF + S Sbjct: 218 NAEALFNTEWHQLSLQVYGDRGDKSVGLFIDSDKISTSELGLPLSLLDDFQDIRSGERRT 277 Query: 643 TIVSNDDGT---PAPIDL*WLSLSCD 711 ++ ++ GT P+D+ + ++ CD Sbjct: 278 SMATSGRGTHINSVPVDIQFFNIHCD 303 >UniRef50_Q05707 Cluster: Collagen alpha-1(XIV) chain precursor; n=33; Euteleostomi|Rep: Collagen alpha-1(XIV) chain precursor - Homo sapiens (Human) Length = 1796 Score = 46.0 bits (104), Expect = 0.001 Identities = 43/206 (20%), Positives = 88/206 (42%), Gaps = 18/206 (8%) Frame = +1 Query: 184 FQTVDLIAVYRLDRSDTTGVTLVQGSQDLQMAYRIGGGANLTLPLKEVFPGGLPNRFSVE 363 F+ +++ + D S GV++ G+ ++ Y++ A ++ P + + P GLP+ +++ Sbjct: 1230 FKMMEMFGLVEKDFSSVEGVSMEPGTFNVFPCYQLHKDALVSQPTRYLHPEGLPSDYTIS 1289 Query: 364 GTFNARGQ--RRPWSL--LRARSNNVLFSLILMPEPRKVAVLVQGSRA-----VFKSPEL 516 F + P++L + ++++ L +IL + + F+ PE+ Sbjct: 1290 FLFRILPDTPQEPFALWEILNKNSDPLVGVILDNGGKTLTYFNYDQSGDFQTVTFEGPEI 1349 Query: 517 ---FTTGWHKLHVAVANRSVHIAVDCVEL--NPVNISAYDFSNAT----SLTIVSNDDGT 669 F +HKLH+ V+ V + +DC ++ +N SA S+ + G Sbjct: 1350 RKIFYGSFHKLHIVVSETLVKVVIDCKQVGEKAMNASANITSDGVEVLGKMVRSRGPGGN 1409 Query: 670 PAPIDL*WLSLSCDRYVLKEDSCEEI 747 AP L + C D C E+ Sbjct: 1410 SAPFQLQMFDIVCSTSWANTDKCCEL 1435 >UniRef50_UPI00006600F1 Cluster: Collagen alpha-1(XIV) chain precursor (Undulin).; n=2; Clupeocephala|Rep: Collagen alpha-1(XIV) chain precursor (Undulin). - Takifugu rubripes Length = 1776 Score = 45.6 bits (103), Expect = 0.001 Identities = 45/213 (21%), Positives = 88/213 (41%), Gaps = 19/213 (8%) Frame = +1 Query: 166 NSKDSDFQTVDLIAVYRLDRSDTTGVTLVQGSQDLQMAYRIGGGANLTLPLKEVFPGGLP 345 +S F+ ++L + S +GV++V G+ + + + A L P + + P GLP Sbjct: 1236 DSATPGFRMMELFGLEESRYSSVSGVSMVPGTFNSFPCFHLQADAFLAQPTRFIHPEGLP 1295 Query: 346 NRFSVEGTFNARGQ--RRPWSLLRARSNNV--LFSLILMPEPRKVAVLVQGSRA-----V 498 + +++ F P++L N L +I+ + + + Sbjct: 1296 SDYTITLLFRLLPDTPEEPFALWEVLDKNQEPLVGVIVDNGGKTLTFFNNDYKGEFQTVT 1355 Query: 499 FKSPE---LFTTGWHKLHVAVANRSVHIAVDCVELNPVNISAYDFSNATSLTIV------ 651 F+ PE LF +HKLH+A++ S + +DC + +I+A + + ++ Sbjct: 1356 FEGPEIQKLFYGSFHKLHIAISKTSAKVVIDCRMVAEKSINAAGNISRDGVVVLGRMVRS 1415 Query: 652 -SNDDGTPAPIDL*WLSLSCDRYVLKEDSCEEI 747 N D + AP L ++C D C E+ Sbjct: 1416 RGNKDNS-APFQLQIFDIACSTLWASRDKCCEL 1447 >UniRef50_Q99715 Cluster: Collagen alpha-1(XII) chain precursor; n=68; Euteleostomi|Rep: Collagen alpha-1(XII) chain precursor - Homo sapiens (Human) Length = 3063 Score = 44.4 bits (100), Expect = 0.003 Identities = 35/131 (26%), Positives = 61/131 (46%), Gaps = 12/131 (9%) Frame = +1 Query: 238 GVTLVQGSQDLQMAYRIGGGANLTLPLKEVFPGGLPNRFSVEGTFNARGQ--RRPWSL-- 405 GV+L GS AYRI A + P ++ P GLP +++ F + P+++ Sbjct: 2539 GVSLESGSFPSYSAYRIQKNAFVNQPTADLHPNGLPPSYTIILLFRLLPETPSDPFAIWQ 2598 Query: 406 LRARSNNVLFSLILMPEPRKVAVLVQGSR-----AVFKSPE---LFTTGWHKLHVAVANR 561 + R +I P + ++ + +R F + E LF +HK+H+ V ++ Sbjct: 2599 ITDRDYKPQVGVIADPSSKTLSFFNKDTRGEVQTVTFDTEEVKTLFYGSFHKVHIVVTSK 2658 Query: 562 SVHIAVDCVEL 594 SV I +DC E+ Sbjct: 2659 SVKIYIDCYEI 2669 >UniRef50_UPI00006A0426 Cluster: Collagen alpha-1(XIV) chain precursor (Undulin).; n=2; Xenopus tropicalis|Rep: Collagen alpha-1(XIV) chain precursor (Undulin). - Xenopus tropicalis Length = 1105 Score = 44.0 bits (99), Expect = 0.004 Identities = 46/235 (19%), Positives = 96/235 (40%), Gaps = 19/235 (8%) Frame = +1 Query: 100 YSNGQADNFTTMAPFARSFCSPNSKD-SDFQTVDLIAVYRLDRSDTTGVTLVQGSQDLQM 276 Y NG+ ++ + R+ C + + F+ +++ + + S+ GV++ G+ + Sbjct: 649 YFNGEGNSVSASG---RTHCPTATYNLPGFKMMEMFGLVEKEYSNMEGVSMEPGTFNNFP 705 Query: 277 AYRIGGGANLTLPLKEVFPGGLPNRFSVEGTFNARGQ--RRPWSL--LRARSNNVLFSLI 444 YR+ A L+ P K + P GLP+ ++V F + P++L + + L +I Sbjct: 706 CYRLHKDALLSQPTKYIHPEGLPSDYTVSFIFRILPDTPKEPFALWEILNKDYEPLVGVI 765 Query: 445 LMPEPRKVAVL-----VQGSRAVFKSPE---LFTTGWHKLHVAVANRSVHIAVDCVELNP 600 + + + Q F+ PE +F +HK+H+ + + + +DC ++ Sbjct: 766 VDNGGKTLTYFNYDYKGQFQTITFEGPEVGKIFYGSFHKVHIVITKTTTRLVIDCKQVAE 825 Query: 601 VNISAYDFSNATSLTIVS------NDDGTPAPIDL*WLSLSCDRYVLKEDSCEEI 747 +I+A L ++ AP L + C D C E+ Sbjct: 826 KSINAAGNITTDGLEVLGRMVRSRGPKDNSAPFQLQMFDIVCTTSWAARDKCCEL 880 >UniRef50_Q4SH63 Cluster: Chromosome 8 SCAF14587, whole genome shotgun sequence; n=2; Eukaryota|Rep: Chromosome 8 SCAF14587, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 1557 Score = 44.0 bits (99), Expect = 0.004 Identities = 46/207 (22%), Positives = 83/207 (40%), Gaps = 19/207 (9%) Frame = +1 Query: 184 FQTVDLIAVYRLDRSDTTGVTLVQGSQDLQMAYRIGGGANLTLPLKEVFPGGLPNRFSVE 363 F+ +D + + S GV+L GS + +YR+ A ++ P K + P LP+ +++ Sbjct: 1089 FRMMDKFGLVEKEYSTIPGVSLEPGSFNSFPSYRLHRDALVSQPTKYLHPEPLPSDYTIS 1148 Query: 364 GTFNARGQ--RRP---WSLLRARSNNVLFSLILMPEPRKVAVLVQGSRAVFKS------- 507 + + P W +L R N L LIL + + + F++ Sbjct: 1149 IMLRLLPETPQEPFALWEILNKR-NEPLVGLILDNSGKTLTFFNHDFKGEFQTVTFEGNE 1207 Query: 508 -PELFTTGWHKLHVAVANRSVHIAVDCVELNPVNISAYDFSNATSLTIVS------NDDG 666 +LF +HKLHV ++ SV +DC + ++ A + I+ Sbjct: 1208 IKKLFHGSFHKLHVTISKTSVKAVLDCSAVGEKSVYAAGNITTDGVEILGRMVRSRGRRD 1267 Query: 667 TPAPIDL*WLSLSCDRYVLKEDSCEEI 747 + AP L + C + D C E+ Sbjct: 1268 SSAPFQLQMFDIICSTSWARRDKCCEL 1294 >UniRef50_Q8C4S5 Cluster: 16 days embryo head cDNA, RIKEN full-length enriched library, clone:C130002K24 product:procollagen, type XI, alpha 1, full insert sequence; n=7; Murinae|Rep: 16 days embryo head cDNA, RIKEN full-length enriched library, clone:C130002K24 product:procollagen, type XI, alpha 1, full insert sequence - Mus musculus (Mouse) Length = 275 Score = 42.3 bits (95), Expect = 0.013 Identities = 35/130 (26%), Positives = 64/130 (49%), Gaps = 10/130 (7%) Frame = +1 Query: 226 SDTTGV-TLVQGSQDLQMAYRIGGGANLTLPLKEVFPGGL-PNRFSVEGTFNARGQRRPW 399 S TTG T + S+D +AYR+ A ++ P K++FPGG+ P FS+ F + ++ Sbjct: 54 SKTTGFCTNRKNSKDPDVAYRVTEEAQISAPTKQLFPGGIFPQDFSI--LFTIKPKKGTQ 111 Query: 400 SLLRARSNNVLFSLILMPEPRKVAVLVQG--SRAVFKSPELFTT------GWHKLHVAVA 555 + L + N + + R L + + ++ LF+T WH++ ++V Sbjct: 112 AFLLSLYNEHGIQQLGVEVGRSPVFLFEDHTGKPTPENYPLFSTVNIADGKWHRVAISVE 171 Query: 556 NRSVHIAVDC 585 ++V + VDC Sbjct: 172 KKTVTMIVDC 181 >UniRef50_P12107 Cluster: Collagen alpha-1(XI) chain precursor; n=83; Euteleostomi|Rep: Collagen alpha-1(XI) chain precursor - Homo sapiens (Human) Length = 1806 Score = 42.3 bits (95), Expect = 0.013 Identities = 37/130 (28%), Positives = 61/130 (46%), Gaps = 10/130 (7%) Frame = +1 Query: 226 SDTTGV-TLVQGSQDLQMAYRIGGGANLTLPLKEVFPGG-LPNRFSVEGTFNARGQRRPW 399 S TTG T + S+ AYR+ A L+ P K++FPGG P FS+ F + ++ Sbjct: 55 SKTTGFCTNRKNSKGSDTAYRVSKQAQLSAPTKQLFPGGTFPEDFSI--LFTVKPKKGIQ 112 Query: 400 SLLRARSNNVLFSLILMPEPRKVAVLVQ---GSRA-----VFKSPELFTTGWHKLHVAVA 555 S L + N I + R L + G A +F++ + WH++ ++V Sbjct: 113 SFLLSIYNEHGIQQIGVEVGRSPVFLFEDHTGKPAPEDYPLFRTVNIADGKWHRVAISVE 172 Query: 556 NRSVHIAVDC 585 ++V + VDC Sbjct: 173 KKTVTMIVDC 182 >UniRef50_UPI000065E422 Cluster: Collagen alpha-1(XI) chain precursor.; n=1; Takifugu rubripes|Rep: Collagen alpha-1(XI) chain precursor. - Takifugu rubripes Length = 1668 Score = 41.1 bits (92), Expect = 0.031 Identities = 38/146 (26%), Positives = 66/146 (45%), Gaps = 11/146 (7%) Frame = +1 Query: 244 TLVQGSQDLQMAYRIGGGANLTLPLKEVFPGGL-PNRFSVEGTFNARGQRRPWSLLRARS 420 TL +G++ +AYR+ A ++ P K++FP G+ P FS+ T + + S L + Sbjct: 27 TLRRGAKP-DIAYRVSKAAQISAPTKQLFPEGVFPEDFSIMFTIKPKAGLQ--SFLLSVY 83 Query: 421 NNVLFSLILMPEPRKVAVLVQGSRA--------VFKSPELFTTGWHKLHVAVANRSVHIA 576 NN + + R L + +F S L WH++ ++V +SV I Sbjct: 84 NNQGIQQLGVEVGRGPVFLYEDQHGKPSADEYPLFHSVNLADGKWHRVALSVDKKSVTII 143 Query: 577 VDCVE--LNPVNISAYDFSNATSLTI 648 VDC + P+ S+ N +T+ Sbjct: 144 VDCKKKVTRPLRRSSRAVINTEGITV 169 >UniRef50_Q4RWT3 Cluster: Chromosome 15 SCAF14981, whole genome shotgun sequence; n=2; Clupeocephala|Rep: Chromosome 15 SCAF14981, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 1877 Score = 40.7 bits (91), Expect = 0.040 Identities = 34/123 (27%), Positives = 57/123 (46%), Gaps = 9/123 (7%) Frame = +1 Query: 244 TLVQGSQDLQMAYRIGGGANLTLPLKEVFPGGL-PNRFSVEGTFNARGQRRPWSLLRARS 420 TL +G + +AYR+ A ++ P K++FPGG+ P FS+ T + + S L + Sbjct: 41 TLRRGGKP-DIAYRVSKTAQISTPTKQLFPGGVFPQDFSIMFTIKPKAGLQ--SFLLSMY 97 Query: 421 NNVLFSLILMPEPRKVAVLVQGSRA--------VFKSPELFTTGWHKLHVAVANRSVHIA 576 N+ + + R L + +F S L WH++ ++V +SV I Sbjct: 98 NHQGIQQLGVEVGRGPVFLYEDQHGKPSADEYPLFHSVNLADGKWHRVALSVDKKSVTII 157 Query: 577 VDC 585 VDC Sbjct: 158 VDC 160 >UniRef50_UPI0000F1F770 Cluster: PREDICTED: similar to collagen, type XXI, alpha 1,; n=2; Danio rerio|Rep: PREDICTED: similar to collagen, type XXI, alpha 1, - Danio rerio Length = 429 Score = 39.9 bits (89), Expect = 0.071 Identities = 46/180 (25%), Positives = 74/180 (41%), Gaps = 17/180 (9%) Frame = +1 Query: 253 QGSQDLQMAYRIGGGANLTLPLKEVFPGGLPNRFSVEGTFNARGQ--RRPWSL--LRARS 420 QGS AY++ +L+ + +FPGGLP + T +G W L ++ + Sbjct: 224 QGSFYGSKAYQVTSRDDLSESTRMLFPGGLPPSYVFVATLKYKGSVAIEEWDLWRIQTKD 283 Query: 421 NNVLFSLILMPEPRKVAVLVQGSR-------AVF-KSP--ELFTTGWHKLHVAVANRSVH 570 ++ L R V S+ VF KSP +LF WH+L + V V Sbjct: 284 EKPQMAVSLNGLDRTVMFTTTTSKTPSGTQTVVFTKSPAKKLFDEKWHQLRLLVTEEDVT 343 Query: 571 IAVDCVELNPVNISAYD--FSNATSLTIVSNDDGTPAPIDL*WLSLSCD-RYVLKEDSCE 741 + VD +E+ + + D F N + T P ++ L + CD +E +CE Sbjct: 344 LYVDDLEIETLALEPPDGIFINGKTQVGKYVSKETTVPFEVQKLRIYCDPEQNNRETACE 403 >UniRef50_P25940 Cluster: Collagen alpha-3(V) chain precursor; n=77; Euteleostomi|Rep: Collagen alpha-3(V) chain precursor - Homo sapiens (Human) Length = 1745 Score = 39.9 bits (89), Expect = 0.071 Identities = 34/130 (26%), Positives = 56/130 (43%), Gaps = 1/130 (0%) Frame = +1 Query: 277 AYRIGGGANLTLPLKEVFPGG-LPNRFSVEGTFNARGQRRPWSLLRARSNNVLFSLILMP 453 A+RIG + L +P E+FP G P FS+ T RGQ S+L + + + + Sbjct: 64 AFRIGQASTLGIPTWELFPEGHFPENFSLLITL--RGQPANQSVLLSIYDERGARQLGLA 121 Query: 454 EPRKVAVLVQGSRAVFKSPELFTTGWHKLHVAVANRSVHIAVDCVELNPVNISAYDFSNA 633 + +L R + + L WH++ V++ V + DC PV F + Sbjct: 122 LGPALGLLGDPFRPLPQQVNLTDGRWHRVAVSIDGEMVTLVADCEAQPPVLGHGPRFISI 181 Query: 634 TSLTIVSNDD 663 LT++ D Sbjct: 182 AGLTVLGTQD 191 >UniRef50_UPI0000F2130C Cluster: PREDICTED: hypothetical protein; n=1; Danio rerio|Rep: PREDICTED: hypothetical protein - Danio rerio Length = 921 Score = 39.1 bits (87), Expect = 0.12 Identities = 43/180 (23%), Positives = 78/180 (43%), Gaps = 16/180 (8%) Frame = +1 Query: 250 VQGSQDLQMAYRIGGGANLTLPLKEVFPGGLPNRFSVEGTFNARGQRRPWSL----LRAR 417 +QGS + AY + ++T +++FP GLP + T +G S ++++ Sbjct: 246 IQGSLVSEGAYLLDKTTDITENTRDIFPEGLPPSYVFVSTLRLKGSSSTESFDLWRVKSK 305 Query: 418 SNNVLFSLILMPEPRKVAVLV-----QGSRAVFKSP---ELFTTGWHKLHVAVANRSVHI 573 + ++ L + ++ Q VF +P +F WH+L V V R V Sbjct: 306 DGQIQAAVTLSGLQKYISFTTTNTFNQEQTVVFDAPGIEAVFDGSWHQLKVLVKPRQVIC 365 Query: 574 AVDCVEL--NPVNISAYDFSNA-TSLTIVSNDDGTPAPIDL*WLSLSCD-RYVLKEDSCE 741 +D E+ P++ F N T ++ S D + P ++ L L CD + +E +CE Sbjct: 366 YLDDQEIQDEPLDPVVPIFINGKTQISKRSGSDAS-VPTEIQKLRLYCDPQQSERETACE 424 >UniRef50_UPI00015A783F Cluster: LOC553362 protein; n=1; Danio rerio|Rep: LOC553362 protein - Danio rerio Length = 1353 Score = 38.3 bits (85), Expect = 0.22 Identities = 32/130 (24%), Positives = 57/130 (43%), Gaps = 7/130 (5%) Frame = +1 Query: 253 QGSQDLQMAYRIGGGANLTLPLKEVFP-GGLPNRFSVEGTFNARGQRRPWSLLRARSNNV 429 +G+ + MAYRI L+ P K++FP P FS+ T A+ + + L V Sbjct: 27 KGTSETDMAYRIDKKIQLSAPTKQLFPDSAFPENFSLMTTVKAKRNSQFFLLSLYDEQGV 86 Query: 430 -LFSLILMPEPRKVAVLVQGSRA-----VFKSPELFTTGWHKLHVAVANRSVHIAVDCVE 591 L + P + +G A +F+ L WH++ +V +SV + +DC + Sbjct: 87 QQLGLEMGRSPVFLYEDHKGQPAPDLYPLFRKINLSDGKWHRIAYSVEGKSVTLYLDCKK 146 Query: 592 LNPVNISAYD 621 + + + D Sbjct: 147 VQTLELMRGD 156 >UniRef50_UPI000069F8A6 Cluster: Collagen alpha-1(XII) chain precursor.; n=2; Xenopus tropicalis|Rep: Collagen alpha-1(XII) chain precursor. - Xenopus tropicalis Length = 1304 Score = 35.9 bits (79), Expect = 1.2 Identities = 43/185 (23%), Positives = 73/185 (39%), Gaps = 15/185 (8%) Frame = +1 Query: 238 GVTLVQGSQDLQMAYRIGGGANLTLPLKEVFPGGLPNRFSVEGTFNARGQ--RRPWSL-- 405 GV++ GS + AY + A ++LP E+ P GLP +++ F + P+SL Sbjct: 848 GVSMQPGSFNSFPAYNLHKDAFISLPTSELHPEGLPPSYTIILLFRLLPETPNEPFSLWQ 907 Query: 406 LRARSNNVLFSLILMPEPRKVAVLVQGS-----RAVFKSPE---LFTTGWHKLHVAVANR 561 + + + L P + ++ +G+ F E LF +HK+ + V+ Sbjct: 908 ITDKGYKPQVGVNLDPSKKTLSFFNKGAAGDTQTVTFDGNEVKKLFYGSFHKVLIVVSPS 967 Query: 562 SVHIAVDCVELNPVNISAYDFSNATSLTIVS---NDDGTPAPIDL*WLSLSCDRYVLKED 732 V I +DC E+ I A ++ D AP L + C D Sbjct: 968 YVRIYIDCSEVAEKEIKEAGNITAEGYEVLGKTLKGDKKSAPFLLQMFDIVCTVTWTSRD 1027 Query: 733 SCEEI 747 C +I Sbjct: 1028 RCCDI 1032 >UniRef50_UPI000069F8A4 Cluster: Collagen alpha-1(XII) chain precursor.; n=1; Xenopus tropicalis|Rep: Collagen alpha-1(XII) chain precursor. - Xenopus tropicalis Length = 1779 Score = 35.9 bits (79), Expect = 1.2 Identities = 43/185 (23%), Positives = 73/185 (39%), Gaps = 15/185 (8%) Frame = +1 Query: 238 GVTLVQGSQDLQMAYRIGGGANLTLPLKEVFPGGLPNRFSVEGTFNARGQ--RRPWSL-- 405 GV++ GS + AY + A ++LP E+ P GLP +++ F + P+SL Sbjct: 1258 GVSMQPGSFNSFPAYNLHKDAFISLPTSELHPEGLPPSYTIILLFRLLPETPNEPFSLWQ 1317 Query: 406 LRARSNNVLFSLILMPEPRKVAVLVQGS-----RAVFKSPE---LFTTGWHKLHVAVANR 561 + + + L P + ++ +G+ F E LF +HK+ + V+ Sbjct: 1318 ITDKGYKPQVGVNLDPSKKTLSFFNKGAAGDTQTVTFDGNEVKKLFYGSFHKVLIVVSPS 1377 Query: 562 SVHIAVDCVELNPVNISAYDFSNATSLTIVS---NDDGTPAPIDL*WLSLSCDRYVLKED 732 V I +DC E+ I A ++ D AP L + C D Sbjct: 1378 YVRIYIDCSEVAEKEIKEAGNITAEGYEVLGKTLKGDKKSAPFLLQMFDIVCTVTWTSRD 1437 Query: 733 SCEEI 747 C +I Sbjct: 1438 RCCDI 1442 >UniRef50_Q4SD21 Cluster: Chromosome 14 SCAF14645, whole genome shotgun sequence; n=1; Tetraodon nigroviridis|Rep: Chromosome 14 SCAF14645, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 2255 Score = 35.9 bits (79), Expect = 1.2 Identities = 23/73 (31%), Positives = 36/73 (49%) Frame = +1 Query: 142 FARSFCSPNSKDSDFQTVDLIAVYRLDRSDTTGVTLVQGSQDLQMAYRIGGGANLTLPLK 321 F F SP F+ ++ + S T GV++ GS + +YRI A LT P Sbjct: 1475 FLNGFTSPG-----FRMLEAFNLTEKTYSYTKGVSMEPGSFNSFTSYRIHKNAFLTQPSA 1529 Query: 322 EVFPGGLPNRFSV 360 +V P GLP+ +++ Sbjct: 1530 DVHPDGLPHAYTI 1542 >UniRef50_UPI0000F20887 Cluster: PREDICTED: similar to alpha 1 type XVI collagen; n=1; Danio rerio|Rep: PREDICTED: similar to alpha 1 type XVI collagen - Danio rerio Length = 221 Score = 35.1 bits (77), Expect = 2.0 Identities = 19/75 (25%), Positives = 37/75 (49%), Gaps = 2/75 (2%) Frame = +1 Query: 529 WHKLHVAVANRSVHIAVDC--VELNPVNISAYDFSNATSLTIVSNDDGTPAPIDL*WLSL 702 WHK+ +++ S + VDC +E P+ + + +L + D P ID+ + + Sbjct: 89 WHKVALSLQRESASLHVDCSSIENKPLELRGQLPISGHTLLGMRATDAAPVEIDIQQVMV 148 Query: 703 SCDRYVLKEDSCEEI 747 CD + +++C EI Sbjct: 149 YCDPSLAIQEACCEI 163 >UniRef50_Q4SMJ3 Cluster: Chromosome 18 SCAF14547, whole genome shotgun sequence; n=3; Clupeocephala|Rep: Chromosome 18 SCAF14547, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 1557 Score = 35.1 bits (77), Expect = 2.0 Identities = 43/156 (27%), Positives = 70/156 (44%), Gaps = 17/156 (10%) Frame = +1 Query: 193 VDLIAVYRLDRSDTTGVTLVQG------SQDL-QMAYRIGGGANLTLPLKEVFP-GGLPN 348 VD++ V L D GV+L G Q+L +A+RI L+ P +++FP P Sbjct: 1 VDVLKVLELSE-DMEGVSLEAGMCTSREGQELTDLAFRIDKKIQLSAPTRQLFPHSSFPM 59 Query: 349 RFSVEGTFNARGQRRPWSLLRARSNNVLFSLILMPEPRKVAVLVQGSRAVFKSPELFTT- 525 FSV T A + + LL + L L E + V + + SPEL+ T Sbjct: 60 NFSVMTTVRAVKDSQVF-LLSLYDSQGTQQLGL--EIGRSPVFLYEDQEGQPSPELYPTF 116 Query: 526 --------GWHKLHVAVANRSVHIAVDCVELNPVNI 609 WH++ +V + V + +DCV L+ +++ Sbjct: 117 RKINLADGKWHRVAYSVEGQLVTLYLDCVRLDTLDL 152 >UniRef50_UPI0000E48BC9 Cluster: PREDICTED: similar to alpha 1 (V) collagen; n=4; Strongylocentrotus purpuratus|Rep: PREDICTED: similar to alpha 1 (V) collagen - Strongylocentrotus purpuratus Length = 1223 Score = 34.7 bits (76), Expect = 2.7 Identities = 46/206 (22%), Positives = 79/206 (38%), Gaps = 9/206 (4%) Frame = +1 Query: 193 VDLIAVYRLDRSDTTGVTLVQGSQDLQMAYRIGGGANLTLPLKEVFPGGLPNRFSVEGTF 372 VDL+ R + ++G + YR N T P V P+ FSV Sbjct: 16 VDLLHDLRNKAPKAGRIKEIEGENGC-LVYRFKPQGNSTFPTANVVGTSFPSSFSVLMNM 74 Query: 373 NARGQRRPWSLLRA--RSNNVLFSLILMPEPRKVAVLVQGSRAV-FKSPELFTTGWHKLH 543 Q ++ +NN+L L + +V V +Q + F + + WHK+ Sbjct: 75 QY-SQEDLGDIVTVVDANNNILLRLRMGYSIFEVQVRIQRLTSYRFVAEDFADNKWHKVA 133 Query: 544 VAVANRSVHIAVDCVELNPVNISA----YDFSNATSLTIVS--NDDGTPAPIDL*WLSLS 705 V + + + +DC E+ + A DF+ A + N++ P D+ + S Sbjct: 134 VGINKNQIKVYMDCAEVATLPTRAGNARIDFTTAGKFVVGGHYNENDVPFKGDVQNVIFS 193 Query: 706 CDRYVLKEDSCEEIDNPNYLIATSAP 783 D +K+ E + + TSAP Sbjct: 194 DDFKAVKDCEFEYPCYRDGQLVTSAP 219 >UniRef50_A7S1U8 Cluster: Predicted protein; n=1; Nematostella vectensis|Rep: Predicted protein - Nematostella vectensis Length = 693 Score = 34.7 bits (76), Expect = 2.7 Identities = 29/108 (26%), Positives = 48/108 (44%), Gaps = 4/108 (3%) Frame = +1 Query: 274 MAYRIGGGANL-TLPLKEVFPGGLPNRFSVEGTFNARGQRRPWSLLRARSNNVLFSLILM 450 +AY + G L ++P +FP G+P FS+ T +L + N SL L Sbjct: 73 VAYHLRGTELLQSVPTGTLFPYGIPESFSIMATVRTEIDNSG-NLFSVYNANGELSLSLS 131 Query: 451 PEPRKVAVLVQG---SRAVFKSPELFTTGWHKLHVAVANRSVHIAVDC 585 P ++ + SR F++ L WH++ ++VA V + DC Sbjct: 132 VNPVELQYRTRSGGVSRIQFQA-SLADGEWHRIAISVARDQVKLLSDC 178 >UniRef50_Q8WU66 Cluster: Protein TSPEAR precursor; n=29; Euteleostomi|Rep: Protein TSPEAR precursor - Homo sapiens (Human) Length = 669 Score = 34.7 bits (76), Expect = 2.7 Identities = 37/151 (24%), Positives = 65/151 (43%), Gaps = 15/151 (9%) Frame = +1 Query: 178 SDFQTVDLIAVYRLDRSDTTGVTLVQ--GSQDLQMAYRIGGGANLTLPLKEVFPGG--LP 345 +D + +D++A T+G+ +VQ G++ LQ++ + ++ P +F P Sbjct: 27 TDLRPLDILAEVVPSDGATSGIRIVQVHGARGLQLS--VAAPRTMSFPASRIFSQCDLFP 84 Query: 346 NRFSVEGTF---NARGQRRPWSL--LRARSNNVLFSLILMPEPRKVAVLVQGS------R 492 FS+ T N +R + L + S+ +L L L P L + + R Sbjct: 85 EEFSIVVTLRVPNLPPKRNEYLLTVVAEESDLLLLGLRLSPAQLHFLFLREDTAGAWQTR 144 Query: 493 AVFKSPELFTTGWHKLHVAVANRSVHIAVDC 585 F+SP L WH L +AV+ + DC Sbjct: 145 VSFRSPALVDGRWHTLVLAVSAGVFSLTTDC 175 >UniRef50_A3RYW5 Cluster: Co-activator of prophage gene expression IbrA; n=7; Proteobacteria|Rep: Co-activator of prophage gene expression IbrA - Ralstonia solanacearum UW551 Length = 416 Score = 34.3 bits (75), Expect = 3.5 Identities = 15/29 (51%), Positives = 18/29 (62%) Frame = +1 Query: 85 SILFSYSNGQADNFTTMAPFARSFCSPNS 171 S+ F ++ AD TTM P A S CSPNS Sbjct: 386 SLSFGFARQDADGLTTMEPPAESTCSPNS 414 >UniRef50_Q1LYI5 Cluster: Novel collagen protein; n=2; Danio rerio|Rep: Novel collagen protein - Danio rerio (Zebrafish) (Brachydanio rerio) Length = 873 Score = 33.9 bits (74), Expect = 4.6 Identities = 45/179 (25%), Positives = 70/179 (39%), Gaps = 15/179 (8%) Frame = +1 Query: 250 VQGSQDLQMAYRIGGGANLTLPLKEVFPGGLPNRFSVEGTFNARGQRR--PWSLLRARSN 423 VQGS + A ++ ++T +E+FP GLP F T + + LLR S Sbjct: 227 VQGSLISERASQLSPRMDITHKTREIFPEGLPPAFVFVATLRLKNPAHWMKFDLLRVLSQ 286 Query: 424 NVLFSLILMPEPRKVAV-------LVQGSRAVFKS---PELFTTGWHKLHVAVANRSVHI 573 + + + + +V L +F LF T WH+L V R + Sbjct: 287 DGVKQIAVTVNGADKSVIFTCTSTLKTEQTVIFNDRGIKRLFDTDWHQLKFLVKPRRITC 346 Query: 574 AVD--CVELNPVNISAYDFSNATSLTIVSNDDGTPAPIDL*WLSLSCD-RYVLKEDSCE 741 VD VE ++ + N + + T PI L L L CD + +E +CE Sbjct: 347 FVDGAYVEEQLLDPVVPIYINGKTQVAKKVNIETTVPILLQKLRLYCDPQQSERETACE 405 >UniRef50_Q1VR50 Cluster: Glycosyl hydrolase, family 16; n=1; Psychroflexus torquis ATCC 700755|Rep: Glycosyl hydrolase, family 16 - Psychroflexus torquis ATCC 700755 Length = 253 Score = 33.9 bits (74), Expect = 4.6 Identities = 21/70 (30%), Positives = 36/70 (51%), Gaps = 4/70 (5%) Frame = +1 Query: 484 GSRAVFKSPELF--TTGWHKLHVAVANRSVHIAV--DCVELNPVNISAYDFSNATSLTIV 651 G+ ++ +PE +G + + + V+N + IA D +E+ + I DF N + IV Sbjct: 68 GTSSIVINPEKTYDASGTYDVKLTVSNENSEIAAFQDIIEIEIILIINGDFENGSEGWIV 127 Query: 652 SNDDGTPAPI 681 DD PAP+ Sbjct: 128 GVDDNVPAPV 137 >UniRef50_Q4SXE3 Cluster: Chromosome undetermined SCAF12445, whole genome shotgun sequence; n=2; Tetraodon nigroviridis|Rep: Chromosome undetermined SCAF12445, whole genome shotgun sequence - Tetraodon nigroviridis (Green puffer) Length = 2225 Score = 33.5 bits (73), Expect = 6.1 Identities = 13/25 (52%), Positives = 18/25 (72%) Frame = +1 Query: 511 ELFTTGWHKLHVAVANRSVHIAVDC 585 +LF +HKLHVAV+ S +A+DC Sbjct: 1612 KLFYGSFHKLHVAVSQTSAKVAIDC 1636 >UniRef50_Q2G901 Cluster: Transcriptional regulator, LysR family; n=1; Novosphingobium aromaticivorans DSM 12444|Rep: Transcriptional regulator, LysR family - Novosphingobium aromaticivorans (strain DSM 12444) Length = 302 Score = 33.5 bits (73), Expect = 6.1 Identities = 22/81 (27%), Positives = 32/81 (39%) Frame = +1 Query: 442 ILMPEPRKVAVLVQGSRAVFKSPELFTTGWHKLHVAVANRSVHIAVDCVELNPVNISAYD 621 I+ P VA L G + +PE F WH V V + + ++A D Sbjct: 130 IIQPNEHSVAELEAGRADLLVTPEPFLASWHPSEVLFDEEQVVVGWAQNPIFARGVTADD 189 Query: 622 FSNATSLTIVSNDDGTPAPID 684 F A +T+ + TPA D Sbjct: 190 FYAAGHVTVAFGANRTPAFAD 210 >UniRef50_Q4E0W9 Cluster: Putative uncharacterized protein; n=3; Trypanosoma cruzi|Rep: Putative uncharacterized protein - Trypanosoma cruzi Length = 699 Score = 33.5 bits (73), Expect = 6.1 Identities = 22/80 (27%), Positives = 35/80 (43%), Gaps = 1/80 (1%) Frame = -1 Query: 608 MFTGFNSTQSTAI*TDRFATATCSLCHPVVNNSGLLNTALEPWTRTATFLGSGIKMRENS 429 + TG NST + + T R SLCH +++ GLL P L ++ + Sbjct: 226 LVTGLNSTATEELLTSRLFGGDVSLCHDSIDSFGLLTVFRAP---IQELLARHEELAPSL 282 Query: 428 TLFERARRRLHGLR-CPRAL 372 T+ R+ LH CP+ + Sbjct: 283 TILRRSFAALHPSNGCPKTI 302 >UniRef50_Q17A79 Cluster: Collagen alpha chain, anopheles; n=7; Coelomata|Rep: Collagen alpha chain, anopheles - Aedes aegypti (Yellowfever mosquito) Length = 1746 Score = 33.5 bits (73), Expect = 6.1 Identities = 30/112 (26%), Positives = 48/112 (42%), Gaps = 7/112 (6%) Frame = +1 Query: 277 AYRIGGGANLTLPLKEVFPGGLPNRFSVEGTFNARGQRRPWSLLRARSNNVLFSLILMPE 456 AY + L++ +VFP G P+ FS+ A L S++ L+LM Sbjct: 16 AYNLNQDTVLSIGTTQVFPNGFPSDFSILVVLKATPNLVRVPLFTVYSSDSEEVLMLM-V 74 Query: 457 PRKVAVLVQGSRAVFKSPELFTTG-------WHKLHVAVANRSVHIAVDCVE 591 +VA+ Q + + L + G WH+L ++V SV + DC E Sbjct: 75 GMEVALYYQDTDGNPEEESLISFGVSIDDERWHRLGISVKGDSVTLIKDCHE 126 >UniRef50_A0DPA5 Cluster: Chromosome undetermined scaffold_59, whole genome shotgun sequence; n=3; Oligohymenophorea|Rep: Chromosome undetermined scaffold_59, whole genome shotgun sequence - Paramecium tetraurelia Length = 167 Score = 33.5 bits (73), Expect = 6.1 Identities = 19/57 (33%), Positives = 27/57 (47%), Gaps = 1/57 (1%) Frame = +1 Query: 79 FLSILFSYSNGQADNFTTMAPFARSFCSPNSKDSDFQTVDLIAVY-RLDRSDTTGVT 246 FL +F + D F F F N KDS D++A+Y R+ +SD GV+ Sbjct: 104 FLQKIFKRHDSDKDGFLNQEEFV--FLMKNYKDSHLTEADMLAIYNRMSQSDPKGVS 158 >UniRef50_Q19350 Cluster: Drosophila crumbs homolog protein 1; n=2; Caenorhabditis|Rep: Drosophila crumbs homolog protein 1 - Caenorhabditis elegans Length = 1722 Score = 33.1 bits (72), Expect = 8.1 Identities = 16/40 (40%), Positives = 18/40 (45%) Frame = +2 Query: 356 VWREPSMLADNEGRGVFCEHVQITCCFLSF*CRNQGKWRS 475 VW DN RG C+H TC L F C N G R+ Sbjct: 1385 VWNNTVCNCDNNWRGAHCQHQMDTC--LDFPCNNDGVCRT 1422 Database: uniref50 Posted date: Oct 5, 2007 11:19 AM Number of letters in database: 575,637,011 Number of sequences in database: 1,657,284 Lambda K H 0.318 0.134 0.401 Gapped Lambda K H 0.279 0.0580 0.190 Matrix: BLOSUM62 Gap Penalties: Existence: 9, Extension: 2 Number of Hits to DB: 782,546,770 Number of Sequences: 1657284 Number of extensions: 16167194 Number of successful extensions: 41455 Number of sequences better than 10.0: 49 Number of HSP's better than 10.0 without gapping: 39829 Number of HSP's successfully gapped in prelim test: 0 Number of HSP's that attempted gapping in prelim test: 0 Number of HSP's gapped (non-prelim): 41411 length of database: 575,637,011 effective HSP length: 99 effective length of database: 411,565,895 effective search space used: 66673674990 frameshift window, decay const: 40, 0.1 T: 12 A: 40 X1: 16 ( 7.3 bits) X2: 37 (14.9 bits) X3: 62 (25.0 bits) S1: 41 (21.7 bits)
- SilkBase 1999-2023 -