Conclusion
Assessment
Binding Mode
Motif Status
Notes
Comments
Likely to be sequence specific TF
1 Monomer or homomultimer
No motif
(PMID: 26876175) has a ChIP-seq motif with GGGARWTGTAGT consensus. THAP11 is an atypical zinc-finger paralogous to THAP1, which has a crystal structure (PDB: 2KO0) where the protein contacts bases TGGGCA
Description
Description:
THAP domain containing 11 [Source:HGNC Symbol;Acc:HGNC:23194]
Entrez Summary
TBA
Ensembl ID:
ENSG00000168286
External Link:
T152763_1.02
Interpro
IPR006612 ; ;
Protein Domain:
Protein: ENSP00000304689DBD: THAPOther: CPSF100_C, DDHD, DUF2722
Previous Annotations
Source
Annotation
TF-CAT classification
No PMIDS:
Vaquerizas 2009 TF classification
"a " Has direct evidence of TF function;
"b " Has evidence for an orthologous TF;
"c " contains likely DBDs, but has no functional evidence;
"x " is an unlikely TF such as predicted gene, genes with likely non-specific DBDs or that have function outside transcription;
"other " category contains proteins without clear DBDs they curated from external sources.
c
CisBP considers it as a TF?
Yes
TFclass considers it as a TF?
Yes
Has GO:0003700 "transcription factor activity, sequence-specific DNA binding"
No
GO-Info
Initial Assessment
1a1 Protein has a high confidence PWM (HT-SELEX, PBM or B1H model) or there is a crystal structure that supports sequence specific DNA binding;
1a2 There is high confidence data for a close ortholog (as defined in CisBP);
2a1 There is lower confidence direct evidence, such as a Jaspar, Hocomoco or Transfac model;
2a2 There is lower confidence evidence for an close ortholog;
3a There is decent circumstantial evidence for its role as a TF or not;
4a Two or more datasets predict it as a TF;
5a One of the source datasets predicts is as a TF
4a, two or more datasets predict it as a TF
TF has conditional DNA-binding requirements
DNA-Binding
Published Motif Data
Structure
Experimental History
{"regions": [{"startStyle": "curved", "end": 138, "endStyle": "jagged", "aliStart": 6, "text": "THAP", "colour": "#009900", "aliEnd": 92, "start": 6, "href": "http://pfam.xfam.org/family/PF05485.10", "type": "pfama", "display": "true", "metadata": {"end": 138, "description": "The THAP domain is a putative DNA-binding domain (DBD) and probably also binds a zinc ion. It features the conserved C2CH architecture (consensus sequence: Cys - 2-4 residues - Cys - 35-50 residues - Cys - 2 residues - His). Other universal features include the location of the domain at the N-termini of proteins, its size of about 90 residues, a C-terminal AVPTIF box and several other conserved residues. Orthologues of the human THAP domain have been identified in other vertebrates and probably worms and flies, but not in other eukaryotes or any prokaryotes [1].", "database": "PfamA", "aliStart": 6, "scoreName": "E-value", "accession": "PF05485.10", "start": 6, "score": 1.8e-11, "identifier": "THAP domain", "type": "DBD", "aliEnd": 92}}, {"startStyle": "straight", "end": 223, "endStyle": "jagged", "aliStart": 104, "text": "DUF2722", "colour": "#9999ff", "aliEnd": 142, "start": 96, "href": "http://pfam.xfam.org/family/PF10846.6", "type": "pfama", "display": "true", "metadata": {"end": 223, "description": "This eukaryotic family of proteins has no known function.", "database": "PfamA", "aliStart": 104, "scoreName": "E-value", "accession": "PF10846.6", "start": 96, "score": 9.8e-05, "identifier": "Protein of unknown function (DUF2722)", "type": "DBD", "aliEnd": 142}}, {"startStyle": "jagged", "end": 289, "endStyle": "jagged", "aliStart": 101, "text": "DDHD", "colour": "#9999ff", "aliEnd": 157, "start": 36, "href": "http://pfam.xfam.org/family/PF02862.15", "type": "pfama", "display": "true", "metadata": {"end": 289, "description": "The DDHD domain is 180 residues long and contains four conserved residues that may form a metal binding site. The domain is named after these four residues. This pattern of conservation of metal binding residues is often seen in phosphoesterase domains. This domain is found in retinal degeneration B proteins, as well as a family of probable phospholipases. It has been shown that this domain is found in a longer C terminal region that binds to PYK2 tyrosine kinase. These proteins have been called N-terminal domain-interacting receptor (Nir1, Nir2 and Nir3) [1]. This suggests that this region is involved in functionally important interactions in other members of this family.", "database": "PfamA", "aliStart": 101, "scoreName": "E-value", "accession": "PF02862.15", "start": 36, "score": 0.0015, "identifier": "DDHD domain", "type": "DBD", "aliEnd": 157}}, {"startStyle": "jagged", "end": 172, "endStyle": "jagged", "aliStart": 104, "text": "CPSF100_C", "colour": "#9999ff", "aliEnd": 149, "start": 32, "href": "http://pfam.xfam.org/family/PF13299.4", "type": "pfama", "display": "true", "metadata": {"end": 172, "description": "This family lies at the C-terminus of many fungal and plant cleavage and polyadenylation specificity factor subunit 2 proteins. The exact function of the domain is not known, but is likely to function as a binding domain for the protein within the overall CPSF complex [1].", "database": "PfamA", "aliStart": 104, "scoreName": "E-value", "accession": "PF13299.4", "start": 32, "score": 0.0023, "identifier": "Cleavage and polyadenylation factor 2 C-terminal", "type": "DBD", "aliEnd": 149}}], "length": 315}