关于 MACS call ATAC-seq 数据 Peak 时候的激烈讨论(MACS3 正在开发中)

Twitter 热论:ATAC peak calling with MACS2 问题

MACS3 github专门开放了一个征对 ATAC-seq call peak 的讨论模块:https://github.com/macs3-project/MACS/discussions/435

image.png
image.png

liu tao 学生 HMMRATAC 开发者不同意 xi chen 说法:https://twitter.com/epigeneticsnerd/status/1337081681141002240

So if you are going to use MACS1/2 for ATAC-seq, and I will instead recommend using HMMRATAC or MACS3 (which will use the **HMMRATAC **algorithm for **ATAC **analysis), then the best settings are -BAMPE --call-summits.

image.png
image.png

To start, a disclosure, I was @fooliu grad student and** together we developed HMMRATAC**. As part of that development, I spent countless hours running MACS with various settings.

One unique feature of ATAC-seq, as opposed to ChIP or DNase-seq, is that the transposase can insert into the linker regions between adjacent nucleosomes.

This creates a library containing fragments of various sizes, corresponding to transposase insertion into nucleosome-free regions, and across nucleosome arrays.

We also found that true nucleosome-free regions are marked by an enrichment of short fragments flanked by nucleosome-sized fragments.False positive sites are often called because they contain an enrichment of just one sized fragments (say those <100bp etc)

What happens then, is single end data, or data forced to be single end, calls more false positives than the properly paired data. Using MACS without -BAMPE haslower precision than when using -BAMPE. The HMMRATAC paper has a figure showing this.

However, i understand the reasoning behind this. With ATAC-seq, many times a researcher is interested in the TF binding site. And with ATAC-seq, this means finding the "footprint". A footprint should enrich for cutting sites, hence the focus on cut sites.

But again, many of those cutting sites are in the linker regions. So if you call peaks using only cut sites, you will end up looking at a lot of linkers.

Instead, you should seek to call the most accurate peaks, with the BAMPE settings, THEN look for the footprint. So how do you do that? With MACS, use --call-summits

Because, as it turns out, the summit of an ATAC peak is the footprint. We also show this in the **HMMRATAC **paper.

So if you are going to use MACS1/2 for ATAC-seq, and I will instead recommend using HMMRATAC or MACS3 (which will use the HMMRATAC algorithm for ATAC analysis), then the best settings are -BAMPE --call-summits.

ATAC peak calling with MACS2
认为加上参数 **-f BED --shift -100 --extsize 200** 更好。

image
image

ATAC peak calling with MACS2: I know this is a recurring problem but I got asked a few times recently, so I just put all info here for my own ref. We now routinely convert paired BAM to simple BED file and use "-f BED --shift -100 --extsize 200" for the peak calling. Why? 1/7

Like many other peak callers, MACS2 is for ChIP. In ChIP,the reads are flanking the actual binding of the protein. Therefore, many peak callers shift or extend reads towards the mid. of the fragments to reflect the actual binding. MCS2 uses extension, the "--ext" flag. 2/7

image
image

In ATAC/DNase, the mid of the fragments is not really what we are interested in. Instead, we are interested in the blue and red dots in the fig. which are the cutting sites of the enzyme. Those dots are the start (5' end) of your reads, so default of MACS2 doesn't fit here 3/7
image.png
image.png

Instead, we want to call peaks with fragments centred on the 5' end of your reads. We had this discuss about 6 years ago in the MACS2 google usergroup. Both
@fooliu and @anshulkundaje provided excellent info. Check this link if haven't seen it before: https://groups.google.com/g/macs-announcement/c/4OCE59gkpKY/m/v9Tnh9jWriUJ… 4/7

An update of **MACS2 **(ver 2.1.0 20140616) was made after the discussion, and you could freely manipulate the read positions with the combination of the "--shift" and the "--extsize" flags. Then, why covert paired BAM to simple BED to use "-f BED" ? 5/

According to Issue #145 from the MACS2 GitHub https://github.com/macs3-project/MACS/issues/145… . When you set "-f BAM" or "-f BAMPE", MACS2 only takes the left read, ignoring the other. However in ATAC the 5' end of both R1 and R2 are of interest. Convert to BED will solve this problem, I guess?. 6/7

  • Hi very interesting, thank you! I notice that ENCODE3 and others use** --nomodel --shift -37 --extsize 73**. Do you have any comments on this? Thanks
  • We use 200 due to habit. The choice is arbitrary. Not sure how diff it makes. Intuitively smaller fragLen give better res. but it can’t be too short. Check Fig1 in this ref (not ATAC though) https://ncbi.nlm.nih.gov/pmc/articles/PMC2596141/… 75 gives better res. Maybe that’s the reason ENCODE use it.

Not sure if all those modifications will make huge difference, but the results are different. I haven't tried HMMRATAC, Genrich, MACS3 etc. yet. Will do in future. Hope this helps to those who are new to this type of analysis. 7/7

到底哪种更好?希望大家能够积极讨论,也欢迎去 MACS3 社区积极谈论自己的看法。
https://github.com/macs3-project/MACS/discussions

我是搬运工,我负责搬运,你负责参考吸收,不对的别喷我,你如果说我搬运,我也可以选择不搬运,反正我这又不是什么盈利性质的,但是我只是觉得想将自己的所见所闻分享给大家,希望能被有需要的人看到,只要有一个能受到帮助,就够了。

©著作权归作者所有,转载或内容合作请联系作者
  • 序言:七十年代末,一起剥皮案震惊了整个滨河市,随后出现的几起案子,更是在滨河造成了极大的恐慌,老刑警刘岩,带你破解...
    沈念sama阅读 193,968评论 5 459
  • 序言:滨河连续发生了三起死亡事件,死亡现场离奇诡异,居然都是意外死亡,警方通过查阅死者的电脑和手机,发现死者居然都...
    沈念sama阅读 81,682评论 2 371
  • 文/潘晓璐 我一进店门,熙熙楼的掌柜王于贵愁眉苦脸地迎上来,“玉大人,你说我怎么就摊上这事。” “怎么了?”我有些...
    开封第一讲书人阅读 141,254评论 0 319
  • 文/不坏的土叔 我叫张陵,是天一观的道长。 经常有香客问我,道长,这世上最难降的妖魔是什么? 我笑而不...
    开封第一讲书人阅读 52,074评论 1 263
  • 正文 为了忘掉前任,我火速办了婚礼,结果婚礼上,老公的妹妹穿的比我还像新娘。我一直安慰自己,他们只是感情好,可当我...
    茶点故事阅读 60,964评论 4 355
  • 文/花漫 我一把揭开白布。 她就那样静静地躺着,像睡着了一般。 火红的嫁衣衬着肌肤如雪。 梳的纹丝不乱的头发上,一...
    开封第一讲书人阅读 46,055评论 1 272
  • 那天,我揣着相机与录音,去河边找鬼。 笑死,一个胖子当着我的面吹牛,可吹牛的内容都是我干的。 我是一名探鬼主播,决...
    沈念sama阅读 36,484评论 3 381
  • 文/苍兰香墨 我猛地睁开眼,长吁一口气:“原来是场噩梦啊……” “哼!你这毒妇竟也来了?” 一声冷哼从身侧响起,我...
    开封第一讲书人阅读 35,170评论 0 253
  • 序言:老挝万荣一对情侣失踪,失踪者是张志新(化名)和其女友刘颖,没想到半个月后,有当地人在树林里发现了一具尸体,经...
    沈念sama阅读 39,433评论 1 290
  • 正文 独居荒郊野岭守林人离奇死亡,尸身上长有42处带血的脓包…… 初始之章·张勋 以下内容为张勋视角 年9月15日...
    茶点故事阅读 34,512评论 2 308
  • 正文 我和宋清朗相恋三年,在试婚纱的时候发现自己被绿了。 大学时的朋友给我发了我未婚夫和他白月光在一起吃饭的照片。...
    茶点故事阅读 36,296评论 1 325
  • 序言:一个原本活蹦乱跳的男人离奇死亡,死状恐怖,灵堂内的尸体忽然破棺而出,到底是诈尸还是另有隐情,我是刑警宁泽,带...
    沈念sama阅读 32,184评论 3 312
  • 正文 年R本政府宣布,位于F岛的核电站,受9级特大地震影响,放射性物质发生泄漏。R本人自食恶果不足惜,却给世界环境...
    茶点故事阅读 37,545评论 3 298
  • 文/蒙蒙 一、第九天 我趴在偏房一处隐蔽的房顶上张望。 院中可真热闹,春花似锦、人声如沸。这庄子的主人今日做“春日...
    开封第一讲书人阅读 28,880评论 0 17
  • 文/苍兰香墨 我抬头看了看天上的太阳。三九已至,却和暖如春,着一层夹袄步出监牢的瞬间,已是汗流浃背。 一阵脚步声响...
    开封第一讲书人阅读 30,150评论 1 250
  • 我被黑心中介骗来泰国打工, 没想到刚下飞机就差点儿被人妖公主榨干…… 1. 我叫王不留,地道东北人。 一个月前我还...
    沈念sama阅读 41,437评论 2 341
  • 正文 我出身青楼,却偏偏与公主长得像,于是被迫代替她去往敌国和亲。 传闻我的和亲对象是个残疾皇子,可洞房花烛夜当晚...
    茶点故事阅读 40,630评论 2 335

推荐阅读更多精彩内容