教育资源为主的文档平台

当前位置: 查字典文档网> 所有文档分类> 高等教育> 生物学> Canonical correlation analysis for RNA-seq co-expression networks

Canonical correlation analysis for RNA-seq co-expression networks

上传者:李颖哲
|
上传时间:2015-04-29
|
次下载

Canonical correlation analysis for RNA-seq co-expression networks

RNA-SEQ

Publishedonline4March2013NucleicAcidsResearch,2013,Vol.41,No.8e95

doi:10.1093/nar/gkt145

Canonical

co-expression典型的correlationanalysisforRNA-seqnetworks

ShengjunHong1,2,XiangningChen3,LiJin1andMomiaoXiong1,2,*

1StateKeyLaboratoryofGeneticEngineeringandMOEKeyLaboratoryofContemporaryAnthropology,SchoolofLifeSciencesandInstitutesofBiomedicalSciences,FudanUniversity,Shanghai200433,China,2DivisionofBiostatistics,HumanGeneticsCenter,UniversityofTexasSchoolofPublicHealth,Houston,TX77030,USAand3VirginiaInstituteforPsychiatricandBehavioralGenetics,VirginiaCommonwealthUniversity,Richmond,VA23298-0126,USA

ReceivedNovember28,2012;RevisedFebruary1,2013;AcceptedFebruary13,2013

ABSTRACTtheidenti?edgeneticvariationinassociationstudiesDigitaltranscriptomehasstillbeenlimited(1).Geneexpressionvariationmaysequencing转analysisbynext-generation

discovers录组substantialmRNAsigni?cantlycontributetophenotypevariation(2).Gene

Variationingeneexpression大量的variants.

underliesexpressionanalysesareimportantsourcestostudy

gicalprocessesandholds成manybiolo-

a为key基础tounravellingfunctionofgeneticvariationandareincreasingly

acquiringanimportantroleinunravellingmechanismofmechanismofcommondiseases.However,解决thecomplextraits.Therapidly解决

currentmethodsforconstructionsequencing特性developednext-generationtechnologieshavebeenbecomingthenetworksusingoverallgene构造ofco-expression

expressionareorigin-platformofchoiceforgeneexpressionpro?ling.

allydesignedformicroarrayexpressiondata,andRNA-seqforexpressionpro?lingofferscomprehensivetheyoverlookalargenumberofpictureoftranscriptome提供合的expressions.忽略variationsingene

Touseinformationonexon,genomicplatforms.RNA-seq转录组andissuperior

hasmadea优number秀的to综microarrayofsigni?cant

positionallevelandallele-specificexpressions,qualitativeandquantitative

develop位置wenovelcomponent-basedsion定性improvementsongeneexpres-

analysis的and定provides量的multiplelayersofresolutions

andbivariatecanonical组件methods,single

correlationanalysis,forcon-andtranscriptomecomplexity:theexpression分at辨exon,率

structionofco-expressionnetworkswithRNA-seqsingle-nucleotidepolymorphism(SNP)andpositionaldata.Toevaluatetheperformanceofourlevel;splicing;

forco-expression评估methodsnetworkinferencetheentire拼接post-transcriptionalRNAeditingacross

gene;isoformandallele-speci?cexpressions

(ASE)(1,3–5).data,theyareappliedtolungsquamous推论withRNA-seq

cellcancerVariationincomplexphenotypesisnotcausedbya

expressiondatafromTCGAdatabaseandoursinglegeneactingasamarker,butbyasetofinteracted两bipolardisorderandschizophreniagenesthatareoftenorganizedintovarioustypesofbio-The级的preliminary扰乱的results精神分裂症RNA-seqstudy.logicalnetworks(6).Geneco-expressionnetworksareco-expression初步的demonstratethatthenetworksconstructedbycanonicaloftenusedtoextractimportantinformationcorrelationanalysisandRNA-seqdataproviderichgroupsofco-regulated提取about

genesthatplayacentralrolein

geneticandmolecularinformationtogaininsightregulatoryprocesses.Co-expressionnetworksareabletointobiologicalprocessesanddiseasemechanism.洞察comprehensivelycapturetherelationshipsofindividualOurnewmethodssubstantiallyoutperformcomponentsofthe获transcriptome得perturbedbyenviron-currentstatisticalmethods大量的forco-expression胜过thements(7);hence,theyprovideapowerful扰乱tooltogainnetworkconstructionwithmicroarrayexpressionnewinsightsintothefunctionofgenes,biologicaldataorRNA-seqdatabasedonoverallgeneexpres-processes,theglobalstructureofthetranscriptomeand

mechanismofcomplexdiseases(6,8–11).

sionlevels.Traditionalstatisticalmethodsforconstructionof

co-expressionnetworks,suchasweightedco-expression

INTRODUCTIONnetworks,mutualinformationrelevance

anceselection共同andsparsegraphical关联networks,covari-

model,and共变异

Despitegreatprogressingeneticstudiesofcomplexcorrelationmethodsaremainly图解designedformicroarray局partial部的diseaseshasbeenmade,informationonthefunctionofexpressiondata(12–15).Allthesemethodsuseasingle*Towhomcorrespondenceshouldbeaddressed.Tel:+17135009894;Fax:+17135000900;Email:momiao.xiong@http://wendang.chazidian.com

ßTheAuthor(s)2013.PublishedbyOxfordUniversityPress.

ThisisanOpenAccessarticledistributedunderthetermsoftheCreativeCommonsAttributionNon-CommercialLicense(http://wendang.chazidian.com/licenses/by-nc/3.0/),whichpermitsunrestrictednon-commercialuse,distribution,andreproductioninanymedium,providedtheoriginalworkisproperlycited.Downloaded from http://wendang.chazidian.com/ at Harbin Medical Uniersity on May 1, 2013

RNA-SEQ

e95NucleicAcidsResearch,2013,Vol.41,No.8

valueofsummarizingstatistictorepresentgeneexpressionlevelandoverlookallinformationonexpressiondiffer-enceinexons,genomicpositionandalleles.Therefore,althoughRNA-seqdramaticallyincreasesthelevelofbio-logicaldetails(11),westillusethetraditionalstatisticalmethodsforco-expressionnetworkinference,whicharedesignedformicroarrayexpressiondataanddonotef?-cientlyusealloftheinformationcontainedinRNA-seqdata.Tofullyusethecomprehensiveinformationofthetranscriptomeandcaptureexpressionvariationatthelevelofexon,chromosomalposition,alleleandsplicingisoformswhichareprovidedbyRNA-seq,developmentofpowerfulcomputationaltoolsforexpressiondataanalysisishighlydesirable(16).

Inthisarticle,wedevelopcomputationalmethodstoaddresschallengesarisingfromco-expressionnetworkin-ferencewithRNA-seqdata.Toexploreobservedexpres-sionvariationinexonsoringenomicpositionacrossthegenes,weuseanationanalysis(CCA)普ordinary通的

singlevariatecanonicalcorrel-thatquanti?esthebetweenalinearcombinationof量the化

correlation

expressionsatexonlevelsorpositionlevelsinonegeneandanothersuchcom-binationofexpressionsinasecondgenetoconstructco-expressionnetworks.Speci?cally,theexpressionlevelateachexon,orexpressionlevelateachgenomicposition,willbeconsideredasvariables.Theexonexpressionsorgenomicpositional-levelexpressionsoftwogenesformtwolargesetsofvariables.Wewishtostudythoselinearcombinationsofvariablesmosthighlycorrelated.ThegoalofCCAistoseeklinearcombinationsoftwosetsofvariablesthatmaximizethecorrelationbetweentwosetsofvariables.Toachievethis,we?rstidentifythepairoflinearcombinationsthathavethelargestcorrel-ation.Next,weidentifythepairoflinearcombinationshavingthelargestcorrelationamongallpairsuncorrelatedwiththeinitiallyselectedpair,andsoon.Therefore,CCAmeasurestheco-expressionbetweentwogenesthatcantakegenomicpositionandallelelevelsofexpressionsintoconsideration.TomodelASE,wedevelopbivariateCCAtoconstructco-expressionnetworkswithASEdata,allowinglevelsofASEtovaryacrossSNPsandtoconsidercomplicatedpatternsofASEbecauseofallele-speci?csplicingandalternativetranscriptionstartsites(2).BivariateCCAconsidertwosetsofvectorsofmeasurements.Twoallele-speci?cexpressionsateachSNPformavectorfortheSNP.BivariateCCAistoseekafewlinearcombinationsofvectorswithtwoalleles’expressionsthathavethelargestcorrelations.Therefore,twovariateCCAmeasuretheco-expressionsbetweentwogenesthatcanconsiderexpressionsoftwoallelesateachSNP.ToevaluatetheperformanceofCCAforco-expressionnetworkinferencewithRNA-seqdata,theCCAforco-expressionnetworkconstructionisappliedtolungsquamouscellcancer(LUSC)expressiondatafromTCGAandabipolardisorderandschizophre-niaRNA-seqstudy.We?ndthatCCAforco-expressionnetworkconstructionwithRNA-seqdatasubstantially

outperformsthecurrentstatisticalmethods大量的

co-expression胜过

fornetworkconstructionwithmicroarrayexpressiondataoroverallgeneexpressiondata.Aprogramforimplementing实施

thedevelopedCCAfor

PAGE2OF15

co-expressionnetworkconstructioncanbedownloadedfrombioconductor(http://wendang.chazidian.com/)andourlocalwebsitehttp://www.sph.uth.tmc.edu/hgc/faculty/xiong/index.htm.MATERIALSANDMETHODSDataaccess

TheTCGARNA-seqdatasetsarepubliclyavailablefromtheTCGAwebsite(https://tcga-data.nci.nih.gov/tcga/).PathwaysareavailablefromKEGGdatabase(17,18)(http://www.genome.jp/kegg/http://wendang.chazidian.comAmethodforconstructionofgeneco-expressionnetworks

Ageneco-expressionnetworkisconsideredasanundir-ectedgraph,whereageneisrepresentedasanodeandeachedgeconnectingtwonodesisregardedastheco-expressionrelationshipofthetwoconnectedgenes.Constructionofco-expressionnetworksisoftencarriedoutbydetectingthepairwisecorrelationofgeneco-expression.TheCCAistoseekmaximizationofthecorrelationbetweentwolinearcombinationofthevariablesinthedatasets(19).Supposethatwehavepexonsorpositionsinonegeneandqexonsorpositions

inanothergene.Letxð1Þ

jdenotetheexpressionofthej-thexonorthenumberofreadsatthej-thgenomicposition

withinthe?rstgene.Wecansimilarlyde?nexð2Þ

forthesecondgene.LetXð1Þ¼½xð1Þ1,...,xð1ÞT

andj

Xð2Þ¼½xð2Þð2ÞT

p??

we1,...,xassumethatq??.Fortheconvenienceofpresenta-tion,p q.Let

X¼??Xð1Þ ??Æ

Xð2ÞandƼcovðX,XÞ¼Æ1112

Æ:21Æ22Constructionofco-expressionnetworksisimplementedbyseekingmaximizationofcorrelationÞcoef?cientsbetweenlinearcombinationU¼aTXð1forthe?rstgeneandlinearcombinationV¼bTXð2Þ:maxa,bcorrðU,VÞ¼aTÆ12b

aTÆ11abTÆ22b

ð1Þ

Solutionstotheoptimizationproblemeigenvalues??21!??2the2!ÁÁÁ!??2

(1)arethe

oftheRayleighpandtheircorrespondingeigenvectorsofquotientmatrix:

R¼ÆÀ1=2À1

À1=211Æ12Æ22Æ21Æ11:

InRNA-seqdata,weobserveeithermultipleexon

expressionsorsetsofnumberofreadsatgenomicpositionlevelsacrosstwogenes.Theexon-levelorgenomicpositional-levelexpressionsformtwosetsofvari-ablesortwovectorsofvariables.Canonicalcorrelationbetweentwogenesisto?ndthepairoflinearcombin-ationsofthevariablesdeterminedbyaandbsuchthattheircorrelationismaximized.The?rstpairoflinearcombinationiscalledthe?rstpairofcanonicalvariables.Theirlargestcorrelationiscalledthe?rstcanonicalcor-relations.Next,weidentifythepairoflinearcombinations

Downloaded from http://wendang.chazidian.com/ at Harbin Medical Uniersity on May 1, 2013

RNA-SEQ

PAGE3OF15

thathavethelargestcorrelationamongallpairsuncorrelatedwiththeinitiallyselectedpairandcalledthesecondpairofcanonicalvariables,andsoon.The?rstcanonicalcorrelationisequaltothesquarerootofthelargesteigenvalue??1ofthematrixR,thesecondcanonicalcorrelationisequaltothesquarerootofthelargesteigenvalue??2ofthematrixR,andsoon.LetebetheeigenvectorofthematrixRassociatedwiththejeigenvalue??j.Then,thevectorsofcoef?cientsaandbaregivenby

aj¼ÆÀ1

ÆÀ1=2À1=211Æ1222ej,andbj¼Æ22ej,j¼1,...,p:

Ordinarycorrelationcoef?cientcanonlymeasurethelinearrelationshipbetweentwovariables.Whenitisappliedtoquantifyco-expressionbetweentwogenes,exon-levelexpressionsorgenomicpositional-levelexpres-sionsneedtobeaggregatedintooverallexpressions.Theexon-levelandgenomicpositional-levelexpressionvariationinformationcannotbepreservedintheordinarycorrelation.Canonicalcorrelationisextensionofordinarycorrelationbetweentwovariablestocanonicalcorrelationbetweentwosetsofvariables.Therefore,canonicalcorrelationformeasuringco-expressionbetweentwogenesinRNA-seqdatacanmoreaccuratelyquantifythelinearrelationshipbetweentwogenesthantheordinarycorrelation.

LetPkbetheP-valuepoftheteststatistic(20)

Tk¼À½nÀðp+qÞ??P

logð1À??^2i¼k+1

iÞwithdistribution

2ðpÀkÞðqÀkÞ,wherenissamplesizefortestingthenull

hypothesisH0:??k¼...¼??p¼0.Weassignaweighttotheedgeconnectingtwogenes:

P

p??iIðlogPiÞ

w¼i¼1ð2Þ

IðlogPiÞ

i¼1

whereIðlogPÞ¼

??

0P>0:05

ÀlogPP 0:05

.Whenthedenom-inatoriszero,theweightiszero.

ThemethodfordeterminingthethresholdforretaininganedgebytheCCAmethodishardthresholdmethod(8).We?rstrankededgesbytheirweightsfromthelargesttothesmallest.Wethenselectedgesbypre-determinednumberofedgesorpercentageofedges.Inthisarticle,weselected5%ofedgeswithtopweights.

Theedgeswithranklargerthanthresholdareretainedinthenetwork.

Ingeneral,wehavemultiplecanonicalvariablesandcanonicalcorrelations.Tofullyusecanonicalcorrelationstocharacterizetherelationshipsbetweentwosetsofvariables,weintroducetheweightw.Thelargerthecontributiontotheedgeweight,thelargertheeigenvalueorcanonicalcorrelation.Ifthei-thcanonicalcorrelationisnotsigni?cant(P>0.05),itscontributiontotheedgeweightwillbesmall.Therefore,IðlogðpÞÞisgivenvalueof0.Theedgeweightde?nedbyEquation(2)canfullyusecanonicalcorrelationinformationtomeasurethedegreesofco-expressionoftwogenes.NucleicAcidsResearch,2013,Vol.41,No.8e95

BivariateCCAforconstructionofco-expressionnetworks

withASEdata

WedevelopnovelbivariateCCAforconstructionof

co-expressionnetworkswithASEdata.Letxð1Þjandxð2Þj

bethenumberofreadsofthemajorandminoralleleatthej-theSNPinthegene,respectively.Wecansimilarly

de?neyð1Þð2Þð1Þð2Þ

jandyjforanothergene.LetX¼½x1,x1,...,

xðp1Þ,xðð1Þð2Þp2Þ??TandY¼½y1,y1,...,yðq1Þ,yðq2Þ??T

.De?nelinear

combinationsU¼aTXandV¼bTY,wherea¼½??ð1Þ

1,??ð2Þ1,...,??ðp1Þ,??ðp2Þ??Tandb¼½??ð1Þð2Þ1,??1,...,??ðq1Þ,??ðq2Þ??T.Theselinearcombinationscanberewrittenas

U¼½??ð1Þ??TXð1Þ+½??ð2Þ??TXð2ÞandV¼½??ð1Þ??TYð1Þ+½??ð2Þ??TYð2Þ,

where??ð1Þ¼½??ð1Þ1,...,??ðp1Þ??T,??ð2Þ¼½??ð2Þ1,...,??ðp2Þ??T,??ð1Þ

¼½??ð1Þ,...,??ðq1Þ??T,??ð2Þ¼½??ð2Þ1,...,??ðq2Þ??T,Xð1Þ¼½xð1Þ11,...,

xðp1Þ??TXð2Þ¼½xð2Þ1,...xðp2Þ??T,Yð1Þ¼½yð1Þ1,...,yð1ÞT

andYð2Þ¼½yð2Þq??

1,...,yð2ÞT

De?nethecovarianceq??.

matrices:

Ƽ??Æ ÆxxÆ

Æxyyx

yy

,where

Æ??

Æxð1Þxð1ÞÆxð1Þxð2Þ

xx¼Æ,

xð2ÞxðÆ??1ÞÆxð2Þxð2Þ

Æð1Þð1ÞÆð1Þð2Þ

xy¼ÆTxyxy

yx¼ÆandðÆ??x2Þy

ð1Þ

Æxð2Þyð2ÞÆyð1Þyð1ÞÆyð1Þyð2Þ

yy¼

Æ:yð2Þyð1ÞÆyð2Þyð2Þ

TheCCAseekstomaximize

max??,??corrðU,VÞ¼??TÆxy??

ð3where??¼????TÆxx????TÆyy??

Þ??ð1Þ ????

ð1Þ ??ð2Þand??¼??ð2Þ

.Thesolutionstotheoptimizationproblem(3)aretheeigenvectorsofthematrixwiththeeigenvalues??1!??2!...!??2p:

R¼ÆÀ1=2À1À1=2

xxÆxyÆyyÆyxÆxx:

Ourformulationconsidersthecorrelationbetweenthe

expressionsoftwoalleles.Ifwedonottaketheircorrel-ationsintoaccount,thetwovariateCCAwillbecometwosinglevariateCCA.Again,letPTkbek¼À½nÀðp+qÞ??Pthe2pP-valueofthei¼k+1logð1À??^teststatistic(20)

2iÞ

withdistribution 2ð2pÀkÞð2qÀkÞ,wherenissamplesizefortestingthenullhypothesisH0:??k¼...¼??2p¼0.Weassignaweighttotheedgeconnectingtwogenes:

P

2p??iIðlogPiÞ

i¼1P2pð4Þ

IðlogPiÞ

i¼1

Downloaded from http://wendang.chazidian.com/ at Harbin Medical Uniersity on May 1, 2013

RNA-SEQ

e95NucleicAcidsResearch,2013,Vol.41,No.8PAGE4OF15whereIðlogPÞ¼??0P>0:05

ÀlogPP 0:05.Whenthedenom-SupplementaryFigureS1.Itwasclearlyshownthatthe

inatoriszero,theweightiszero.accuracyofCCAmethodforconstructionofco-SimilartosingleCCAforconstructionofco-expressionexpressionnetworkswasmuchhigherthanthatbynetworks,afterweranktheweights,wealsouserankGLASSOunderalldifferentnetworksizes.Thenweproceduretoprunethenetworks.studiednon-smallcelllungcancerpathwayinKEGG

withLUSCdataset.Afterdiscardingtheisolated

GraphicalLASSOinthepathwayandmatching抛弃nodes

totheTCGALUSC

RNA-seqlevel3data,weincluded44genesinthe

Sparseundirectedgraphicalmodelscanbeestimatedbyanalysis.Theconstructedco-expressionnetworksforthetheuseofL1(LASSO:leastabsoluteshrinkageandnon-smallcelllungcancerpathwaybytheCCAandselectionoperator)regularization(21).WeassumethatGLASSO(21)methodwereshowninFigure1.Edgestheoverallexpressionsofgeneshaveamultivariatewithredcolourwereintheco-expressionnetworkcon-Gaussiandistributionwithmean??andcovariancestructedonlybyCCA.EdgeswithbluecolourwereinmatrixÆ.Itisshowntheco-expressionnetworksconstructedbybothCCAinversematrixÆÀ1thatiftheijthecomponentoftheiszero,thenvariablesiandjareandGLASSO.Edgeswith蓝cyancolourwereintheconditionallyindependent,giventheothervariables.co-expressionnetworksconstructed绿色onlybyGLASSO.Therefore,co-expressionnetworkscanbeconstructedbyItconsistedoffourpathways:ErbBsignallingpathway,estimatingtheinverseofcovariancematrixÆÀ1throughMAPKsignallingpathway,PI3KpathwayandL细apoptosis胞1regularization.pathway.Figure1showedthatEGF(epidermalgrowth凋亡

factor)–EGFR–PI3K/Akt–apoptosissignal表皮pathwayand

MAPK(Raf–MEK–ERK)signalpathwaywereinthe

RESULTSco-expressionnetworkconstructedbytheCCAmethod.CanonicalcorrelationanalysisforconstructionofHowever,EGF–EGFR–PI3KpathwayandMAPKco-expressionnetworkswithexon-levelexpressiondatapathwayconnectionswerenotintheco-expressionAgeneco-expressionnetworkisrepresentedasanundir-networkconstructedbyGLASSO.Recentstudiesun-ectedgraph,whereeachnodedenotes未受指导的covered发现(22)thattheEGFstimulates刺激theproductionofpreciselyageneexpressionpro?le,表ageneormore

and示eachedgeconnect-白interleukin(IL)-8fromlungcancercells,whichinturn

ing精确two的nodesindicatessigni?cantco-expressionrelation-activates细胞介素EGFRandsignallingpathwayofPI3K/Akt.shipsofthetwogenes.ToexploreexonswithvaryingPI3K/Aktpathwayactivationplaysa重crucial要的roleinlungexpressioninformation,wedevelopedaCCA-basedcancerdevelopmentandproliferation.

wasinvolvedingenetranscription,增殖Raf–MEK–ERK

regulationofcellmethodforconstructionofco-expressionnetworks.Thesurvivalandangiogenesisandwasassociatedwithlungcanonicalcorrelationsmeasurethestrengthofassociationmetastasis(23).血管生成

betweenthetwosetsofexonexpression.ToillustratetheNetwork转移topologyplaysanimportantroleintheapplicationofCCAforconstructionofco-expressionfunctionandinformationprocessingofbiologicalnetworkswithexonexpressiondata,theCCAwasnetworks(24).AssortativityandcentralizationaretwoappliedtoLUSCRNA-seqdatasetfromTCGAimportanttopology协measures调集中化database.LUSCconsistsof242samples(225caseisapreference拓扑for学ofnetworks.Assortativityanetwork’snodestoattachtosamplesand17controlsamples).Here,weonlyusedatasimilar倾nodes.向other

Assortativityismeasuredbytheassort-belongingtothecasegroup.TheCCAmethodwasativitycoef?cientthatisde?nedasthePearsoncorrelationcomparedwithagraphicLASSO(GLASSO)(21)coef?cientofdegreebetweenpairsoflinkednodes(25).methodthatusedoverallgeneexpressiontoconstructIftheassortativitycoef?cientwaspositive,thenetworkco-expressionnetwork(fordetail,see‘Materialsandwassaidtobeassortative.Ontheotherhand,iftheMethods’section).Asimplerank-basedcut-offmethodassortativitycoef?cientwasnegative,thenetworkwaswasusedtoprunethenetwork(fordetail,see‘Materialsrecognizedasdisassortative.WeobservethatinandMethods’删section).减networks,highly异connected配的social

nodestendtobeconnectedWe?rstrandomlyselectedTheTCGARNA-seqlevel3withotherhighdegreenodes.Theassortativitycoef?cientdatawith50,200,400,600and1000genes,resultingininsocialnetworksis,hence,positive(25).However,?veRNA-seqdatasets.WethenappliedGLASSOandtechnologicalandbiologicalnetworkstypicallyshowCCAtothese?vesampledRNA-seqdatasetstoconstructthathigh-degreenodestendtoattachtolow-degreeco-expressionnetworksthatwerecalledtheoriginalnodes.Theirassortativitycoef?cientisnegative.For

co-expressionnetworksasthebasistoevaluatetheper-randomnetworks,theassortativitycoef?cienttendedtoformanceofCCAandGLASSOforconstructionofbenearlyzero(Wikipedia,thefreeencyclopedia).Thecen-co-expressionnetworks.Weusedbootstrapping

re-samplingRNA-seqdata1000timesfrom引导程fortralityofavertexeach序of?verelativeimportance顶点withinagraphthatdeterminesthe

ofavertexwithinthegraphisan

RNA-seqdatasets.TheCCAandGLASSOmethodsimportantconceptinnetworktheory.Centralizationiswereappliedtothere-sampledRNA-seqdatatoconstructbasedonthe概念conceptofcentrality.Itisde?nedasco-expressionnetworksfortestingtheaccuracy(howmeasuringthesumindifferencesincentralitybetweenmanyedgesintheoriginalnetworksarereservedinthethemostcentralnodeinanetworkandallothernodes.reconstructedco-expressionnetworksfrom保the留re-sampledItattemptstoquantifythelevelofanetworkabouthowRNA-seqdatasets).Theresultswereshownincentralizeditwasaroundparticularnodes(26).Table1Downloaded from http://wendang.chazidian.com/ at Harbin Medical Uniersity on May 1, 2013

RNA-SEQ

PAGE5OF15NucleicAcidsResearch,2013,Vol.41,No.8e95Figure1.Thesharednetworkstructurebynon-smalllungcancerpathwayinKEGGandreconstructedco-expressionnetworksusingtheCCAandGLASSOmethods.

Table1.Topologypropertyofco-expressionnetworksgeneratedbyexpressionswasmorebiologicallymeaningfulthanthatbyCCA,GLASSO,randomandKEGGoverallgeneexpressions.

MethodAssortativityCentralizationTofurtherevaluatetheperformanceoftheCCAfor

Mean(SD)Mean(SD)constructionofco-expressionnetworks,weappliedboth

CCAandGLASSOmethodstoanotherTCGARNA-seq

CCAÀ0.3937(0.0407)0.7606(0.0078)

GLASSOÀ0.0482(0.0868)0.6297(0.0458)dataset(uterinecorpusendometrioidcarcinoma)where

RandomÀ0.0489(0.0064)0.5666(0.0033)416casesamplesofUCECwereusedinthestudy.KEGGÀ0.23050.7257Thereconstructedpathwayfortheendometrial

bytheCCAandGLASSOmethod子宫were内膜cancer

shownin

SupplementaryFigureS2.Fromthis?gure,itwas

showedthatassortativityandcentralizationofshownthatonlytheCCAmethoddetectedtheMAPKco-expressionnetworksconstructedbytheCCA,(Raf–MEK–ERK)signalpathwayintheco-expressionGLASSO,randomselectionandtruenon-smallcelllungnetwork.Wealsopresentthenetworktopologycompari-cancerpathwayinKEGG,wherestandarddeviationsonoftworeconstructednetworkaswellasrandomcalculatedby1000re-sampling.Weobserved偏差was

thatnetworkandtrue

topologyoftheco-expressionnetworkconstructedbytheSupplementary子endometrioid

Table宫内膜carcinomapathwayin

S1.样的Thistableshowedthatthe

GLASSO(overallgeneexpression)wasclosetotheco-expressednetworkreconstructedbytheCCAmethodrandomnetwork;however,topologyoftheco-expressionwasclosertothepathwayinKEGG,butthetopologyofnetworkconstructedwasclosetothestructureoftrueco-expressionnetworkinferredbytheoverallexpressionsnon-smallcelllungcancerpathwayinKEGG.Table1andGLASSOwasmoresimilartotherandomone.indicatedthattheco-expressionnetworksconstructedby

RNA-seqexonexpressionswerehighlycentralizedandCCAforco-expressionnetworkconstructionwithdisassortative,whichwereinherenttopologyposition-levelRNA-seqdata

thebiologicalnetworks.This固有的featuresinfurtherdemonstratedthatThecurrentmethodsforco-expressionnetworkconstruc-theco-expressionnetworkconstructedbyRNA-seqexontionwithRNA-seqdataareto崩collapse溃倒塌the

内容需要下载文档才能查看

position-levelDownloaded from http://wendang.chazidian.com/ at Harbin Medical Uniersity on May 1, 2013

版权声明:此文档由查字典文档网用户提供,如用于商业用途请与作者联系,查字典文档网保持最终解释权!

下载文档

热门试卷

2016年四川省内江市中考化学试卷
广西钦州市高新区2017届高三11月月考政治试卷
浙江省湖州市2016-2017学年高一上学期期中考试政治试卷
浙江省湖州市2016-2017学年高二上学期期中考试政治试卷
辽宁省铁岭市协作体2017届高三上学期第三次联考政治试卷
广西钦州市钦州港区2016-2017学年高二11月月考政治试卷
广西钦州市钦州港区2017届高三11月月考政治试卷
广西钦州市钦州港区2016-2017学年高一11月月考政治试卷
广西钦州市高新区2016-2017学年高二11月月考政治试卷
广西钦州市高新区2016-2017学年高一11月月考政治试卷
山东省滨州市三校2017届第一学期阶段测试初三英语试题
四川省成都七中2017届高三一诊模拟考试文科综合试卷
2017届普通高等学校招生全国统一考试模拟试题(附答案)
重庆市永川中学高2017级上期12月月考语文试题
江西宜春三中2017届高三第一学期第二次月考文科综合试题
内蒙古赤峰二中2017届高三上学期第三次月考英语试题
2017年六年级(上)数学期末考试卷
2017人教版小学英语三年级上期末笔试题
江苏省常州西藏民族中学2016-2017学年九年级思想品德第一学期第二次阶段测试试卷
重庆市九龙坡区七校2016-2017学年上期八年级素质测查(二)语文学科试题卷
江苏省无锡市钱桥中学2016年12月八年级语文阶段性测试卷
江苏省无锡市钱桥中学2016-2017学年七年级英语12月阶段检测试卷
山东省邹城市第八中学2016-2017学年八年级12月物理第4章试题(无答案)
【人教版】河北省2015-2016学年度九年级上期末语文试题卷(附答案)
四川省简阳市阳安中学2016年12月高二月考英语试卷
四川省成都龙泉中学高三上学期2016年12月月考试题文科综合能力测试
安徽省滁州中学2016—2017学年度第一学期12月月考​高三英语试卷
山东省武城县第二中学2016.12高一年级上学期第二次月考历史试题(必修一第四、五单元)
福建省四地六校联考2016-2017学年上学期第三次月考高三化学试卷
甘肃省武威第二十三中学2016—2017学年度八年级第一学期12月月考生物试卷

网友关注视频

【部编】人教版语文七年级下册《老山界》优质课教学视频+PPT课件+教案,安徽省
苏科版数学七年级下册7.2《探索平行线的性质》
化学九年级下册全册同步 人教版 第22集 酸和碱的中和反应(一)
冀教版小学数学二年级下册第二单元《租船问题》
19 爱护鸟类_第一课时(二等奖)(桂美版二年级下册)_T3763925
沪教版八年级下册数学练习册20.4(2)一次函数的应用2P8
8.练习八_第一课时(特等奖)(苏教版三年级上册)_T142692
苏科版八年级数学下册7.2《统计图的选用》
第19课 我喜欢的鸟_第一课时(二等奖)(人美杨永善版二年级下册)_T644386
每天日常投篮练习第一天森哥打卡上脚 Nike PG 2 如何调整运球跳投手感?
化学九年级下册全册同步 人教版 第18集 常见的酸和碱(二)
北师大版数学四年级下册第三单元第四节街心广场
沪教版八年级下册数学练习册21.3(3)分式方程P17
冀教版小学英语四年级下册Lesson2授课视频
六年级英语下册上海牛津版教材讲解 U1单词
二年级下册数学第一课
外研版八年级英语下学期 Module3
3.2 数学二年级下册第二单元 表内除法(一)整理和复习 李菲菲
北师大版八年级物理下册 第六章 常见的光学仪器(二)探究凸透镜成像的规律
【部编】人教版语文七年级下册《泊秦淮》优质课教学视频+PPT课件+教案,辽宁省
二年级下册数学第三课 搭一搭⚖⚖
【部编】人教版语文七年级下册《过松源晨炊漆公店(其五)》优质课教学视频+PPT课件+教案,江苏省
沪教版牛津小学英语(深圳用)五年级下册 Unit 1
河南省名校课堂七年级下册英语第一课(2020年2月10日)
沪教版牛津小学英语(深圳用) 四年级下册 Unit 4
人教版历史八年级下册第一课《中华人民共和国成立》
第五单元 民族艺术的瑰宝_16. 形形色色的民族乐器_第一课时(岭南版六年级上册)_T3751175
第4章 幂函数、指数函数和对数函数(下)_六 指数方程和对数方程_4.7 简单的指数方程_第一课时(沪教版高一下册)_T1566237
二年级下册数学第二课
沪教版牛津小学英语(深圳用) 四年级下册 Unit 8