首页丁香园病例库全部版块

我的关注

没有关注版块，去热门逛逛吧

热门

热门

行业动态30 条热帖，今日新增 4 条

心情驿站30 条热帖，今日新增 6 条

修复重建和烧伤整形16 条热帖已更新

普通外科13 条热帖，今日新增 3 条

皮肤性病13 条热帖，今日新增 2 条

泌尿外科7 条热帖已更新

求职职场7 条热帖已更新

危重急救5 条热帖，今日新增 1 条

消化内科4 条热帖已更新

骨科4 条热帖已更新

临床内科

临床内科

心血管1 条热帖已更新

呼吸胸外2 条热帖已更新

神经内外最近热帖已更新

肿瘤医学最近热帖已更新

内分泌2 条热帖已更新

肾脏内科1 条热帖已更新

精神心理最近热帖已更新

风湿免疫最近热帖已更新

血液病最近热帖已更新

消化内科4 条热帖已更新

感染最近热帖已更新

临床外科

临床外科

心血管1 条热帖已更新

呼吸胸外2 条热帖已更新

神经内外最近热帖已更新

肿瘤医学最近热帖已更新

泌尿外科7 条热帖已更新

普通外科13 条热帖，今日新增 3 条

修复重建和烧伤整形16 条热帖已更新

耳鼻咽喉头颈外科最近热帖已更新

骨科4 条热帖已更新

临床妇儿

临床妇儿

妇产4 条热帖已更新

儿科2 条热帖已更新

临床其他

临床其他

危重急救5 条热帖，今日新增 1 条

影像核医学最近热帖已更新

中医最近热帖已更新

皮肤性病13 条热帖，今日新增 2 条

临床检验最近热帖已更新

超声医学1 条热帖已更新

麻醉疼痛最近热帖已更新

康复医学最近热帖已更新

护理最近热帖已更新

社区全科最近热帖已更新

临床病理最近热帖已更新

口腔最近热帖已更新

眼科最近热帖已更新

公共卫生最近热帖已更新

考试深造

考试深造

论文写作最近热帖已更新论文写作投稿统计与作图医学英语基金申报开题

本科考研1 条热帖已更新考研本科教育

考博留学最近热帖已更新考博留学考试

执业考试最近热帖已更新

规培最近热帖已更新

职称晋升1 条热帖已更新

行业讨论

行业讨论

行业动态30 条热帖，今日新增 4 条

求职职场7 条热帖已更新

心情驿站30 条热帖，今日新增 6 条

科研医药

科研医药

基础科研细胞生物与生物信息微生物与免疫实验动物与生化组胚细胞技术与形态遗传核酸基因技术蛋白质和糖学实验室建设与采购

医药研发与应用最近热帖已更新合理用药新药信息药理及临床试验药物化学分析技术制剂技术生物制药

其他

其他

学习交流互助专区

更多内容

常用

登录

丁香园社区细胞生物与生物信息帖子详情

GSEA原理以及软件的运行以及常见的错误及解决办法

其他学科医学生 · 发布于 2018-07-31 · IP 北京北京

2.8 万浏览

推荐

这个帖子发布于 7 年零 100 天前，其中的信息可能已发生改变或有所发展。

第一部分 GSEA原理

目标：预先定义的基因集S是否随机的分布在排序的基因list

1. 表达谱,样品分为两类,以1/2定义

GSEA considers experiments with genomewide expression profiles from samples belonging to two classes, labeled
1 or 2.

2. 基因按照表达与分类的相关性排序

Genes are ranked based on the correlation between their expression and the class distinction by using any suitable metric

3. 计算富集打分(ES)

Given an a priori defined set of genes S (e.g., genes encoding products in a metabolic pathway, located in the same cytogenetic band, or sharing the same GO category), the goal of GSEA is to determine whether the members of S are randomly distributed throughout L or primarily found at the top or bottom. We expect that sets related to the phenotypic distinction will tend to show the latter distribution.

Step 1: Calculation of an Enrichment Score.

We calculate an enrichment score (ES) that reflects the degree to which a set S is overrepresented at the extremes (top or bottom) of the entire ranked list L.

The score is calculated by walking down the list L, increasing a running-sum statistic when we encounter a gene in S and decreasing it when we encounter genes not in S.

The magnitude of the increment depends on the correlation of the gene with the phenotype. The enrichment score is the maximum deviation from zero encountered in the random walk; it corresponds to a weighted Kolmogorov–Smirnov-like statistic

a running-sum statistic，

4. 评估ES的显著性(p值)

采用permutation ：可以选择1000次，500次等

5. 多重检验校正(FDR值)

ref：

Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles

http://www.pnas.org/content/102/43/15545

https://blog.csdn.net/qq_29300341/article/details/52956052

42 171 12

默认最新

42

分享帖子

分享到微博

分享到微信

认证

医师认证达人申请

返回顶部