Browsing Expert Curation...

Need help?
Systematic Name Gene Name Motif ID Expert Confidence Dubious? Notes
V YEL009C GCN4 1363 High   Virtually all motifs look the same. MITOMI motif 1363 is as good as any of the ChIP-chip motifs but not circular; scores high across the board.
V YKR099W BAS1 402 High   Virtually all motifs are similar, with GAGTCA core. ChIP motif 402 has highest correspondence to both ChIP-chip and expression data.
V YPL216W   0   Dubious Unlikely to be true TF.
V YPL016W SWI1 0   Dubious Unlikely to be true TF.
V YOR308C SNU66 0   Dubious Unlikely to be true TF.
V YOR304W ISW2 0   Dubious Unlikely to be true TF.
V YOR290C SNF2 0   Dubious Unlikely to be true TF.
V YOR156C NFI1 0   Dubious Unlikely to be true TF.
V YNL227C JJJ1 0   Dubious Unlikely to be true TF.
V YNL079C TPM1 0   Dubious Unlikely to be true TF.
V YNL039W BDP1 0   Dubious Unlikely to be true TF.
V YNL023C FAP1 0   Dubious Unlikely to be true TF.
V YMR213W CEF1 0   Dubious Unlikely to be true TF.
V YMR176W ECM5 0   Dubious Unlikely to be true TF.
V YMR172W HOT1 0   Dubious Unlikely to be true TF.
V YLR254C NDL1 0   Dubious Unlikely to be true TF.
V YLR211C   0   Dubious Unlikely to be true TF.
V YJL176C SWI3 0   Dubious Unlikely to be true TF.
V YIL122W POG1 0   Dubious Unlikely to be true TF.
V YGR140W CBF2 0   Dubious Unlikely to be true TF.
V YGR071C   0   Dubious Unlikely to be true TF.
V YGR002C SWC4 0   Dubious Unlikely to be true TF.
V YGL133W ITC1 0   Dubious Unlikely to be true TF.
V YFR037C RSC8 0   Dubious Unlikely to be true TF.
V YER164W CHD1 0   Dubious Unlikely to be true TF.
V YER159C BUR6 0   Dubious Unlikely to be true TF.
V YER063W THO1 0   Dubious Unlikely to be true TF.
V YDR485C VPS72 0   Dubious Unlikely to be true TF.
V YDR448W ADA2 0   Dubious Unlikely to be true TF.
V YDR409W SIZ1 0   Dubious Unlikely to be true TF.
V YDR362C TFC6 0   Dubious Unlikely to be true TF.
V YDR323C PEP7 0   Dubious Unlikely to be true TF.
V YDR225W HTA1 0   Dubious Unlikely to be true TF.
V YDL074C BRE1 0   Dubious Unlikely to be true TF.
V YCR066W RAD18 0   Dubious Unlikely to be true TF.
V YCR033W SNT1 0   Dubious Unlikely to be true TF.
V YBR060C ORC2 0   Dubious Unlikely to be true TF.
V YBL052C SAS3 0   Dubious Unlikely to be true TF.
V YBL003C HTA2 0   Dubious Unlikely to be true TF.
V YOR172W YRM1 813 High   Two PBM studies largely agree on classic GAL4-class monomeric motif. Motif 813 has indications of spacing and orientation of dimeric protein.
V YPL230W USV1 509 High   Two PBM studies essentially agree on classical C2H2 GGGG-containing motif. Chose 509 because it scores much higher on both ChIP and expression data.
V YML027W YOX1 498 High   Two PBM studies and Pramila et al. (PMID 12464633) agree on classic homeodomain TAATTA motif. All three correlate with expression change and OE. Motif 453 is not a direct measurement so choose PBM motif that is the same length as the typical homeodomain footprint - 498 also correlates best with OE data; expression scores are skewed low by the large number of cell-cycle measurements.
V YIR013C GAT4 565 High   Two PBM motifs look similar, also similar to a subset of other GATAs. 565 scores higher on expression and OE data.
V YBL054W TOD6 852 High   Two PBM motifs largely agree; 852 has higher correspondence to expression data while 495 has higher correspondence to ChIP-chip. Use 852; score is way higher. Also for GO.
V YIL130W ASG1 2116 Medium   Two PBM motifs appear to represent monomeric and dimeric versions of the same motif. This is the monomeric version. No other supporting data; hence medium confidence. Picked 2116 because it has a higher GO score and expression score.
V YIL130W ASG1 807 Medium   Two PBM motifs appear to represent monomeric and dimeric versions of the same motif. This is the dimeric version. No other supporting data; hence medium confidence.
V YMR168C CEP3 524 High   Two PBM motifs agree. Went with 524 because it appears neater. No other supporting data for any of them.
V YLR098C CHA4 2120 High   Two PBM motifs agree, and PBM motif 2120 has highest correspondence to ChIP-chip data, even highter than the best ChIP-chip motif. Has a GAL4-like appearance, albeit a variant. Monomeric. (Highest scoring motif - 1607 - is actually a Rap1 motif).
V YKL222C   2192 High   Two motifs from PBMs resemble monomeric GAL4-like motif. 2192 agrees best with ChIP-chip data and expression data.
V YBR150C TBS1 552 High   Two motifs from PBMs are nearly identical GAL4-class motifs with defined spacing and orientation. Motif 552 has slightly higher scores. Two motifs from BEEML analysis of PBM data give monomeric motif - also give this high confidence.
V YBR150C TBS1 2179 High   Two motifs from PBMs are nearly identical GAL4-class motifs with defined spacing and orientation. Motif 552 has slightly higher scores. Two motifs from BEEML analysis of PBM data give monomeric motif - also give this high confidence.
V YBR239C ERT1 2188 Medium   Three PBM motifs are all classic monomeric GAL4 motifs. Chose 2188 because it has fewer noninformative flanking positions, and higher significance on expression data. Also, 826 has the CCGG core that I suspect may be an artefact of PBMs or the DBD clones used in these studies. The highest-scoring ChIP motif is circular and does not resemble a GAL4 class binding site.
V YOR337W TEA1 817 Medium   Three motifs, all from PBMs. Choose 817 because it has a more robust GAL4 "CGG" core. But there is no convincing corroborating data for either motif and they do not match each other.
V YLL054C   526 Medium   Three motifs available, from PBMs; two dimeric GAL4-like motifs but with different spacings and one monomeric. No backup data but looks tidy. Keep all three.
V YLL054C   2242 Medium   Three motifs available, from PBMs; two dimeric GAL4-like motifs but with different spacings and one monomeric. No backup data but looks tidy. Keep all three.
V YLL054C   816 Medium   Three motifs available, from PBMs; two dimeric GAL4-like motifs but with different spacings and one monomeric. No backup data but looks tidy. Keep all three.
V YGL166W CUP2 48 Medium   Three motifs account for three possible spacings in the literature motif; it is not clear that this is the optimal site, however
V YDR026C   696 High   Three ChIP-chip motifs are virtually identical in appearance; resemble Reb1 motifs; high correspondence to ChIP-chip data
V YPR086W SUA7 1327 Low Dubious This protein is not expected to bind DNA; it is supposed to bind DNA-bound TBP. The TIRF-PBM data used to generate the motif included only 96 sequences.
V YDR174W HMO1 2249 Low   This motif is uncharacteristic for a Sox protein and HMG proteins typically do not bind DNA in a sequence specific manner. Since it is from ChIP data it could be a cofactor motif. Low confidence.
V YJL127C SPT10 1880 Low   This is the protein that binds histone promoters. The sequence specificity is derived from the histone promoters only so the literature motif may be inaccurate. Motif 1880 has higher scores overall but does not resemble the literature motif. Uncertain what to do here - use 1880, but give low confidence. Motif learned in vivo could contain extrinsic information.
V YDR310C SUM1 478 High   This is the motif for the SUM1 AT_hook; scores highest in deletion expression data
V YDR310C SUM1 383 High   This is the motif for the FL SUM1; scores highest on ChIP-chip and resembles the canonical literature motif; also has some relationship to deletion expression data
V YER148W SPT15 798 High   This is TATA-binding protein. PBM motif 798 chosen because 1326 was derived from the 96-sequence TIRF-PBM array instead of a full 40K PBM
V YNR054C ESF2 0   Dubious This is supposed to be a ribosome biogenesis factor. I found no evidence that it is a sequence-specific DNA-binding protein.
V YNL132W KRE33 0   Dubious This is supposed to be a ribosome biogenesis factor. I found no evidence that it is a sequence-specific DNA-binding protein.
V YDL166C FAP7 0   Dubious This is supposed to be a ribosome biogenesis factor. I found no evidence that it is a sequence-specific DNA-binding protein.
V MAL63   136 Medium   This is an unconventional dimeric GAL4-class motif
V YPR186C PZF1 1321 Low   This is a single literature site. The protein almost certainly binds the site but it has not been demonstrated that this is an optimal binding site.
V YLR113W HOG1 0   Dubious This is a signalling molecule that associates with many TFs (see SGD)
V YDR520C URC2 553 High   This is a monomeric GAL4-class motif. Two PBM studies essentially agree, and have some relationship to ChIP-chip data. No other informative data.
V YER051W JHD1 662 Low Dubious This is a histone demethylase. No evidence for direct DNA binding. Motif 662 is significant. Include, but give low confidence - could be a cofactor.
V YBR240C THI2 1449 High   This is a GAL4-class protein. All motifs are ChIP-chip derived, none resembles each other. 1449 is the only one with respectable scores on ChIP and expression,and it also has the appearance of a GAL4 class motif..although, the structural prior presumably forces it to have this property.
V YOR298C-A MBF1 0   Dubious This is a coactivator. I found no evidence that it is a sequence-specific TF.
V YMR164C MSS11 204 Low Dubious There is no evidence that this is a sequence-specific DNA-binding protein, rather than a cofactor. The motif has a limited relationship to ChIP-chip data. The literature motif scores better than the motif derived from the ChIP-chip study. Also, the motif is identical to that for FLO8.
V YOR372C NDD1 0   Dubious There is no evidence that this is a sequence-specific DNA-binding protein, either in vitro or in vivo or in its sequence. The ChIP-chip motif that scores most highly is actually an MCM1 motif, which is consistent with the role of NDD1 as a "transcriptional activator essential for nuclear division".
V YMR075W RCO1 1066 Low Dubious There is no evidence that this is a sequence-specific DNA-binding protein rather than a chromatin factor. The higher-scoring ChIP-chip motif appears to have low information content and does not display strong correspondence to the data it was generated from or to expression data.
V YLR442C SIR3 0   Dubious There is no evidence that the SIR proteins are sequence-specific DNA-binding proteins. Most of the motifs for them are Rap1 sites.
V YKR101W SIR1 0   Dubious There is no evidence that the SIR proteins are sequence-specific DNA-binding proteins. Most of the motifs for them are Rap1 sites.
V YDR227W SIR4 0   Dubious There is no evidence that the SIR proteins are sequence-specific DNA-binding proteins. Most of the motifs for them are Rap1 sites.
V YDL042C SIR2 0   Dubious There is no evidence that the SIR proteins are sequence-specific DNA-binding proteins. Most of the motifs for them are Rap1 sites.
V YGR040W KSS1 0   Dubious There is no evidence that Kss1 is a sequence-specific TF.
V TBP-TFIIB TBP-TFIIB 1329 Medium   The TIRF-PBM data used to generate the motif included only 96 sequences; hence, medium confidence.
V TBP-TFIIA-TFIIB TBP-TFIIA-TFIIB 1330 Medium   The TIRF-PBM data used to generate the motif included only 96 sequences; hence, medium confidence.
V TBP-TFIIA TBP-TFIIA 1328 Low   The TIRF-PBM data used to generate the motif included only 96 sequences. Also it is curious that there is no TATA sequence in the logo.
V YDR213W UPC2 544 High   The SRE is bound by UPC2 and the "canonical" sequence is TCGTATA. However, the more degenerate version obtained by PBM (motif 544) scores better in both expression analysis and OE experiments. Newer motif 2109 scores better on ChIP-chip, but lower on expression, and the SRE is well-characterized....I think this one deserves further experimental analysis.
V YJR147W HMS2 992 Low   The one ChIP-chip motif bears little relationship to the ChIP data.it kind of looks like an HNF-like site, but still, low confidence.
V YNL167C SKO1 1401 High   The MITOMI motif 1401 is an offset and asymmetric version of the traditional consensus (TGACGTCA) but has a higher ChIP-chip and expression correspondence than the motifs that are more symmetric.
V YKL185W ASH1 28 Medium   The literature motif may not represent the full binding activity of the protein. Also, it is not supported by ChIP-chip. ChIP-chip identifies Mcm1-like motifs. But, it does score highly in both ChIP-chip and expression. The only higher-scoring motif has almost no information content.
V YML113W DAT1 1416 Medium   The literature (e.g. PMID: 8532535) suggests that the sequence specificity may be more promiscuous than the name suggests. To my knowledge there has not been any SELEX or PBM demonstrating that any motif is correct. But, it does bear some relationship to ChIP-chip and expression data.
V YLR182W SWI6 0   Dubious Swi6 is a cofactor, not a DNA-binding protein. These motifs are for Mbp1 or Swi4.
V YOR358W HAP5 695 High   Subunit of the heme-activated, glucose-repressed Hap2/3/4/5 CCAAT-binding complex - there should be a single motif for all four proteins, containing CCAAT. ChIP-chip motif 695 resembles CCAATCA, and scores highly on ChIP-chip, OE, and deletion expression data.
V YKL109W HAP4 695 High   Subunit of the heme-activated, glucose-repressed Hap2/3/4/5 CCAAT-binding complex - there should be a single motif for all four proteins, containing CCAAT. ChIP-chip motif 695 resembles CCAATCA, and scores highly on ChIP-chip, OE, and deletion expression data.
V YGL237C HAP2 695 High   Subunit of the heme-activated, glucose-repressed Hap2/3/4/5 CCAAT-binding complex - there should be a single motif for all four proteins, containing CCAAT. ChIP-chip motif 695 resembles CCAATCA, and scores highly on ChIP-chip, OE, and deletion expression data.
V YBL021C HAP3 695 High   Subunit of the heme-activated, glucose-repressed Hap2/3/4/5 CCAAT-binding complex - there should be a single motif for all four proteins, containing CCAAT. ChIP-chip motif 695 resembles CCAATCA, and scores highly on ChIP-chip, OE, and deletion expression data.
V YLR375W STP3 568 Medium   STP3 and 4 have very similar DNA-binding domains. However, they are not similar to those of STP1 and 2; the next most closely related are SWI5 and ACE2, with major differences in the recognition alpha helices. All of the STP4 motifs are different from each other and none have any supporting data. There is only one motif for STP3 (568) from PBM and it matches the STP4 motif from the same study (559) which is the basis for choosing these two motifs.
V YDL048C STP4 559 Medium   STP3 and 4 have very similar DNA-binding domains. However, they are not similar to those of STP1 and 2; the next most closely related are SWI5 and ACE2, with major differences in the recognition alpha helices. All of the STP4 motifs are different from each other and none have any supporting data. There is only one motif for STP3 (568) from PBM and it matches the STP4 motif from the same study (559) which is the basis for choosing these two motifs.
V YHR006W STP2 2174 High   STP1 and 2 have very similar DNA-binding domains. However, they are not similar to those of STP3 and 4. PBM motif for STP2 (2174) correlates highest with ChIP-chip and expression data. ChIP-chip motif for STP1 (660) most strongly resembles motif 800, and scores highly on ChIP-chip data. In addition, these motifs resemble halfmers of literature-derived binding sites.
V YDR463W STP1 660 High   STP1 and 2 have very similar DNA-binding domains. However, they are not similar to those of STP3 and 4. PBM motif for STP2 (800) correlates with ChIP-chip and expression data. ChIP-chip motif for STP1 (660) most strongly resembles motif 800, and scores highly on ChIP-chip data. In addition, these motifs resemble halfmers of literature-derived binding sites.
V YDR169C STB3 2233 High   STB3 binds RRPE element (AAAAATTT) both in vivo and in vitro (PMID 17616518). PBM motifs 810 and 2233 strongly resembles the RRPE element, scores significantly in deletion expression data, and nail the GO categories "nucleolus" and "ribosome biogenesis". 2233 gets slightly higher scores.

 Download displayed records: text csv html excel word Page: 1 of 4  Records: 354