TY - JOUR
T1 - Structures of CTCF-DNA complexes including all 11 zinc fingers
AU - Yang, Jie
AU - Horton, John R.
AU - Liu, Bin
AU - Corces, Victor G.
AU - Blumenthal, Robert M.
AU - Zhang, Xing
AU - Cheng, Xiaodong
N1 - Publisher Copyright:
© 2023 The Author(s).
PY - 2023/9/8
Y1 - 2023/9/8
N2 - The CCCTC-binding factor (CTCF) binds tens of thousands of enhancers and promoters on mammalian chromosomes by means of its 11 tandem zinc finger (ZF) DNA-binding domain. In addition to the 12-15-bp CORE sequence, some of the CTCF binding sites contain 5′ upstream and/or 3′ downstream motifs. Here, we describe two structures for overlapping portions of human CTCF, respectively, including ZF1-ZF7 and ZF3-ZF11 in complex with DNA that incorporates the CORE sequence together with either 3′ downstream or 5′ upstream motifs. Like conventional tandem ZF array proteins, ZF1-ZF7 follow the right-handed twist of the DNA, with each finger occupying and recognizing one triplet of three base pairs in the DNA major groove. ZF8 plays a unique role, acting as a spacer across the DNA minor groove and positioning ZF9-ZF11 to make cross-strand contacts with DNA. We ascribe the difference between the two subgroups of ZF1-ZF7 and ZF8-ZF11 to residues at the two positions -6 and -5 within each finger, with small residues for ZF1-ZF7 and bulkier and polar/charged residues for ZF8-ZF11. ZF8 is also uniquely rich in basic amino acids, which allows salt bridges to DNA phosphates in the minor groove. Highly specific arginine-guanine and glutamine-adenine interactions, used to recognize G:C or A:T base pairs at conventional base-interacting positions of ZFs, also apply to the cross-strand interactions adopted by ZF9-ZF11. The differences between ZF1-ZF7 and ZF8-ZF11 can be rationalized structurally and may contribute to recognition of high-affinity CTCF binding sites.
AB - The CCCTC-binding factor (CTCF) binds tens of thousands of enhancers and promoters on mammalian chromosomes by means of its 11 tandem zinc finger (ZF) DNA-binding domain. In addition to the 12-15-bp CORE sequence, some of the CTCF binding sites contain 5′ upstream and/or 3′ downstream motifs. Here, we describe two structures for overlapping portions of human CTCF, respectively, including ZF1-ZF7 and ZF3-ZF11 in complex with DNA that incorporates the CORE sequence together with either 3′ downstream or 5′ upstream motifs. Like conventional tandem ZF array proteins, ZF1-ZF7 follow the right-handed twist of the DNA, with each finger occupying and recognizing one triplet of three base pairs in the DNA major groove. ZF8 plays a unique role, acting as a spacer across the DNA minor groove and positioning ZF9-ZF11 to make cross-strand contacts with DNA. We ascribe the difference between the two subgroups of ZF1-ZF7 and ZF8-ZF11 to residues at the two positions -6 and -5 within each finger, with small residues for ZF1-ZF7 and bulkier and polar/charged residues for ZF8-ZF11. ZF8 is also uniquely rich in basic amino acids, which allows salt bridges to DNA phosphates in the minor groove. Highly specific arginine-guanine and glutamine-adenine interactions, used to recognize G:C or A:T base pairs at conventional base-interacting positions of ZFs, also apply to the cross-strand interactions adopted by ZF9-ZF11. The differences between ZF1-ZF7 and ZF8-ZF11 can be rationalized structurally and may contribute to recognition of high-affinity CTCF binding sites.
UR - http://www.scopus.com/inward/record.url?scp=85170294232&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85170294232&partnerID=8YFLogxK
U2 - 10.1093/nar/gkad594
DO - 10.1093/nar/gkad594
M3 - Article
C2 - 37439339
AN - SCOPUS:85170294232
SN - 0305-1048
VL - 51
SP - 8447
EP - 8462
JO - Nucleic acids research
JF - Nucleic acids research
IS - 16
ER -