Please note: this site relies heavily on the use of javascript. Without a javascript-enabled browser, this site will not function correctly. Please enable javascript and reload the page, or switch to a different browser.
0  structures 1  species 0  interactions 1  sequence 1  architecture

Protein: CSPG4_HUMAN (Q6UVK1)

Summary

This is the summary of UniProt entry CSPG4_HUMAN (Q6UVK1).

Description: Chondroitin sulfate proteoglycan 4
Source organism: Homo sapiens (Human) (NCBI taxonomy ID 9606)
View Pfam proteome data.
Length: 2322 amino acids

Please note: when we start each new Pfam data release, we take a copy of the UniProt sequence database. This snapshot of UniProt forms the basis of the overview that you see here. It is important to note that, although some UniProt entries may be removed after a Pfam release, these entries will not be removed from Pfam until the next Pfam data release.

Pfam domains

Source Domain Start End
disorder n/a 1 6
sig_p n/a 1 29
low_complexity n/a 3 30
Pfam A Laminin_G_2 55 178
low_complexity n/a 62 76
low_complexity n/a 149 156
disorder n/a 226 227
Pfam A Laminin_G_2 230 363
disorder n/a 274 277
disorder n/a 283 284
disorder n/a 334 340
low_complexity n/a 410 424
disorder n/a 411 413
disorder n/a 577 578
disorder n/a 600 616
disorder n/a 640 642
low_complexity n/a 716 727
disorder n/a 738 744
disorder n/a 796 814
low_complexity n/a 800 814
disorder n/a 818 826
disorder n/a 834 835
disorder n/a 837 840
Pfam B Pfam-B_23253 873 935
low_complexity n/a 905 917
disorder n/a 944 945
disorder n/a 947 948
disorder n/a 971 972
disorder n/a 981 988
low_complexity n/a 1040 1054
disorder n/a 1210 1214
disorder n/a 1261 1262
disorder n/a 1300 1301
disorder n/a 1303 1307
low_complexity n/a 1339 1357
disorder n/a 1401 1411
disorder n/a 1444 1446
disorder n/a 1462 1463
disorder n/a 1477 1483
disorder n/a 1487 1501
disorder n/a 1503 1511
disorder n/a 1521 1522
disorder n/a 1524 1527
disorder n/a 1557 1559
disorder n/a 1603 1606
Pfam B Pfam-B_53 1643 1728
disorder n/a 1669 1674
disorder n/a 1735 1736
disorder n/a 1746 1749
disorder n/a 1760 1763
disorder n/a 1765 1767
disorder n/a 1772 1773
disorder n/a 1796 1811
disorder n/a 1815 1816
disorder n/a 1821 1845
Pfam B Pfam-B_26752 1834 1956
disorder n/a 1852 1858
disorder n/a 1865 1871
low_complexity n/a 1958 1970
low_complexity n/a 2088 2101
disorder n/a 2094 2115
disorder n/a 2120 2143
low_complexity n/a 2128 2139
disorder n/a 2182 2206
transmembrane n/a 2225 2246
low_complexity n/a 2231 2246
disorder n/a 2255 2266
disorder n/a 2268 2276
disorder n/a 2279 2308
low_complexity n/a 2290 2300

Show or hide domain scores.

Sequence annotations

This section shows a graphical representation of this sequence, with Pfam domains shown in the standard Pfam format. Under the Pfam domain image we show various tracks, illustrating features on this sequence that we found in other databases. You can choose which databases to include using the drop-down panel under the image. More...

Note: it can take a few seconds for this image to be generated and loaded.

Loading feature alignment...

Show sources update panel.

Sequence information

This is the amino acid sequence of the UniProt sequence database entry with the accession Q6UVK1. This sequence is stored in the Pfam database and updated with each new Pfam release, but this means that the sequence we store may differ from that stored by UniProt.

Sequence:
1
MQSGPRPPLP APGLALALTL TMLARLASAA SFFGENHLEV PVATALTDID
50
51
LQLQFSTSQP EALLLLAAGP ADHLLLQLYS GRLQVRLVLG QEELRLQTPA
100
101
ETLLSDSIPH TVVLTVVEGW ATLSVDGFLN ASSAVPGAPL EVPYGLFVGG
150
151
TGTLGLPYLR GTSRPLRGCL HAATLNGRSL LRPLTPDVHE GCAEEFSASD
200
201
DVALGFSGPH SLAAFPAWGT QDEGTLEFTL TTQSRQAPLA FQAGGRRGDF
250
251
IYVDIFEGHL RAVVEKGQGT VLLHNSVPVA DGQPHEVSVH INAHRLEISV
300
301
DQYPTHTSNR GVLSYLEPRG SLLLGGLDAE ASRHLQEHRL GLTPEATNAS
350
351
LLGCMEDLSV NGQRRGLREA LLTRNMAAGC RLEEEEYEDD AYGHYEAFST
400
401
LAPEAWPAME LPEPCVPEPG LPPVFANFTQ LLTISPLVVA EGGTAWLEWR
450
451
HVQPTLDLME AELRKSQVLF SVTRGARHGE LELDIPGAQA RKMFTLLDVV
500
501
NRKARFIHDG SEDTSDQLVL EVSVTARVPM PSCLRRGQTY LLPIQVNPVN
550
551
DPPHIIFPHG SLMVILEHTQ KPLGPEVFQA YDPDSACEGL TFQVLGTSSG
600
601
LPVERRDQPG EPATEFSCRE LEAGSLVYVH RGGPAQDLTF RVSDGLQASP
650
651
PATLKVVAIR PAIQIHRSTG LRLAQGSAMP ILPANLSVET NAVGQDVSVL
700
701
FRVTGALQFG ELQKQGAGGV EGAEWWATQA FHQRDVEQGR VRYLSTDPQH
750
751
HAYDTVENLA LEVQVGQEIL SNLSFPVTIQ RATVWMLRLE PLHTQNTQQE
800
801
TLTTAHLEAT LEEAGPSPPT FHYEVVQAPR KGNLQLQGTR LSDGQGFTQD
850
851
DIQAGRVTYG ATARASEAVE DTFRFRVTAP PYFSPLYTFP IHIGGDPDAP
900
901
VLTNVLLVVP EGGEGVLSAD HLFVKSLNSA SYLYEVMERP RHGRLAWRGT
950
951
QDKTTMVTSF TNEDLLRGRL VYQHDDSETT EDDIPFVATR QGESSGDMAW
1000
1001
EEVRGVFRVA IQPVNDHAPV QTISRIFHVA RGGRRLLTTD DVAFSDADSG
1050
1051
FADAQLVLTR KDLLFGSIVA VDEPTRPIYR FTQEDLRKRR VLFVHSGADR
1100
1101
GWIQLQVSDG QHQATALLEV QASEPYLRVA NGSSLVVPQG GQGTIDTAVL
1150
1151
HLDTNLDIRS GDEVHYHVTA GPRWGQLVRA GQPATAFSQQ DLLDGAVLYS
1200
1201
HNGSLSPRDT MAFSVEAGPV HTDATLQVTI ALEGPLAPLK LVRHKKIYVF
1250
1251
QGEAAEIRRD QLEAAQEAVP PADIVFSVKS PPSAGYLVMV SRGALADEPP
1300
1301
SLDPVQSFSQ EAVDTGRVLY LHSRPEAWSD AFSLDVASGL GAPLEGVLVE
1350
1351
LEVLPAAIPL EAQNFSVPEG GSLTLAPPLL RVSGPYFPTL LGLSLQVLEP
1400
1401
PQHGALQKED GPQARTLSAF SWRMVEEQLI RYVHDGSETL TDSFVLMANA
1450
1451
SEMDRQSHPV AFTVTVLPVN DQPPILTTNT GLQMWEGATA PIPAEALRST
1500
1501
DGDSGSEDLV YTIEQPSNGR VVLRGAPGTE VRSFTQAQLD GGLVLFSHRG
1550
1551
TLDGGFRFRL SDGEHTSPGH FFRVTAQKQV LLSLKGSQTL TVCPGSVQPL
1600
1601
SSQTLRASSS AGTDPQLLLY RVVRGPQLGR LFHAQQDSTG EALVNFTQAE
1650
1651
VYAGNILYEH EMPPEPFWEA HDTLELQLSS PPARDVAATL AVAVSFEAAC
1700
1701
PQRPSHLWKN KGLWVPEGQR ARITVAALDA SNLLASVPSP QRSEHDVLFQ
1750
1751
VTQFPSRGQL LVSEEPLHAG QPHFLQSQLA AGQLVYAHGG GGTQQDGFHF
1800
1801
RAHLQGPAGA SVAGPQTSEA FAITVRDVNE RPPQPQASVP LRLTRGSRAP
1850
1851
ISRAQLSVVD PDSAPGEIEY EVQRAPHNGF LSLVGGGLGP VTRFTQADVD
1900
1901
SGRLAFVANG SSVAGIFQLS MSDGASPPLP MSLAVDILPS AIEVQLRAPL
1950
1951
EVPQALGRSS LSQQQLRVVS DREEPEAAYR LIQGPQYGHL LVGGRPTSAF
2000
2001
SQFQIDQGEV VFAFTNFSSS HDHFRVLALA RGVNASAVVN VTVRALLHVW
2050
2051
AGGPWPQGAT LRLDPTVLDA GELANRTGSV PRFRLLEGPR HGRVVRVPRA
2100
2101
RTEPGGSQLV EQFTQQDLED GRLGLEVGRP EGRAPGPAGD SLTLELWAQG
2150
2151
VPPAVASLDF ATEPYNAARP YSVALLSVPE AARTEAGKPE SSTPTGEPGP
2200
2201
MASSPEPAVA KGGFLSFLEA NMFSVIIPMC LVLLLLALIL PLLFYLRKRN
2250
2251
KTGKHDVQVL TAKPRNGLAG DTETFRKVEP GQAIPLTAVP GQGPPPGGQP
2300
2301
DPELLQFCRT PNPALKNGQY WV                              
2322
 

Show the unformatted sequence.

Checksums:
CRC64:0B4F39AFC5ADD3CA
MD5:0a06c8b4afd18f0d460cac8fbecd3eae