Canonical correlation analysis for RNA-seq co-expression networks
上传者:李颖哲|上传时间:2015-04-29|密次下载
Canonical correlation analysis for RNA-seq co-expression networks
RNA-SEQ
Publishedonline4March2013NucleicAcidsResearch,2013,Vol.41,No.8e95
doi:10.1093/nar/gkt145
Canonical
co-expression典型的correlationanalysisforRNA-seqnetworks
ShengjunHong1,2,XiangningChen3,LiJin1andMomiaoXiong1,2,*
1StateKeyLaboratoryofGeneticEngineeringandMOEKeyLaboratoryofContemporaryAnthropology,SchoolofLifeSciencesandInstitutesofBiomedicalSciences,FudanUniversity,Shanghai200433,China,2DivisionofBiostatistics,HumanGeneticsCenter,UniversityofTexasSchoolofPublicHealth,Houston,TX77030,USAand3VirginiaInstituteforPsychiatricandBehavioralGenetics,VirginiaCommonwealthUniversity,Richmond,VA23298-0126,USA
ReceivedNovember28,2012;RevisedFebruary1,2013;AcceptedFebruary13,2013
ABSTRACTtheidenti?edgeneticvariationinassociationstudiesDigitaltranscriptomehasstillbeenlimited(1).Geneexpressionvariationmaysequencing转analysisbynext-generation
discovers录组substantialmRNAsigni?cantlycontributetophenotypevariation(2).Gene
Variationingeneexpression大量的variants.
underliesexpressionanalysesareimportantsourcestostudy
gicalprocessesandholds成manybiolo-
a为key基础tounravellingfunctionofgeneticvariationandareincreasingly
acquiringanimportantroleinunravellingmechanismofmechanismofcommondiseases.However,解决thecomplextraits.Therapidly解决
currentmethodsforconstructionsequencing特性developednext-generationtechnologieshavebeenbecomingthenetworksusingoverallgene构造ofco-expression
expressionareorigin-platformofchoiceforgeneexpressionpro?ling.
allydesignedformicroarrayexpressiondata,andRNA-seqforexpressionpro?lingofferscomprehensivetheyoverlookalargenumberofpictureoftranscriptome提供合的expressions.忽略variationsingene
Touseinformationonexon,genomicplatforms.RNA-seq转录组andissuperior
hasmadea优number秀的to综microarrayofsigni?cant
positionallevelandallele-specificexpressions,qualitativeandquantitative
develop位置wenovelcomponent-basedsion定性improvementsongeneexpres-
analysis的and定provides量的multiplelayersofresolutions
andbivariatecanonical组件methods,single
correlationanalysis,forcon-andtranscriptomecomplexity:theexpression分at辨exon,率
structionofco-expressionnetworkswithRNA-seqsingle-nucleotidepolymorphism(SNP)andpositionaldata.Toevaluatetheperformanceofourlevel;splicing;
forco-expression评估methodsnetworkinferencetheentire拼接post-transcriptionalRNAeditingacross
gene;isoformandallele-speci?cexpressions
(ASE)(1,3–5).data,theyareappliedtolungsquamous推论withRNA-seq
cellcancerVariationincomplexphenotypesisnotcausedbya
expressiondatafromTCGAdatabaseandoursinglegeneactingasamarker,butbyasetofinteracted两bipolardisorderandschizophreniagenesthatareoftenorganizedintovarioustypesofbio-The级的preliminary扰乱的results精神分裂症RNA-seqstudy.logicalnetworks(6).Geneco-expressionnetworksareco-expression初步的demonstratethatthenetworksconstructedbycanonicaloftenusedtoextractimportantinformationcorrelationanalysisandRNA-seqdataproviderichgroupsofco-regulated提取about
genesthatplayacentralrolein
geneticandmolecularinformationtogaininsightregulatoryprocesses.Co-expressionnetworksareabletointobiologicalprocessesanddiseasemechanism.洞察comprehensivelycapturetherelationshipsofindividualOurnewmethodssubstantiallyoutperformcomponentsofthe获transcriptome得perturbedbyenviron-currentstatisticalmethods大量的forco-expression胜过thements(7);hence,theyprovideapowerful扰乱tooltogainnetworkconstructionwithmicroarrayexpressionnewinsightsintothefunctionofgenes,biologicaldataorRNA-seqdatabasedonoverallgeneexpres-processes,theglobalstructureofthetranscriptomeand
mechanismofcomplexdiseases(6,8–11).
sionlevels.Traditionalstatisticalmethodsforconstructionof
co-expressionnetworks,suchasweightedco-expression
INTRODUCTIONnetworks,mutualinformationrelevance
anceselection共同andsparsegraphical关联networks,covari-
model,and共变异
Despitegreatprogressingeneticstudiesofcomplexcorrelationmethodsaremainly图解designedformicroarray局partial部的diseaseshasbeenmade,informationonthefunctionofexpressiondata(12–15).Allthesemethodsuseasingle*Towhomcorrespondenceshouldbeaddressed.Tel:+17135009894;Fax:+17135000900;Email:momiao.xiong@http://wendang.chazidian.com
ßTheAuthor(s)2013.PublishedbyOxfordUniversityPress.
ThisisanOpenAccessarticledistributedunderthetermsoftheCreativeCommonsAttributionNon-CommercialLicense(http://wendang.chazidian.com/licenses/by-nc/3.0/),whichpermitsunrestrictednon-commercialuse,distribution,andreproductioninanymedium,providedtheoriginalworkisproperlycited.Downloaded from http://wendang.chazidian.com/ at Harbin Medical Uniersity on May 1, 2013
RNA-SEQ
e95NucleicAcidsResearch,2013,Vol.41,No.8
valueofsummarizingstatistictorepresentgeneexpressionlevelandoverlookallinformationonexpressiondiffer-enceinexons,genomicpositionandalleles.Therefore,althoughRNA-seqdramaticallyincreasesthelevelofbio-logicaldetails(11),westillusethetraditionalstatisticalmethodsforco-expressionnetworkinference,whicharedesignedformicroarrayexpressiondataanddonotef?-cientlyusealloftheinformationcontainedinRNA-seqdata.Tofullyusethecomprehensiveinformationofthetranscriptomeandcaptureexpressionvariationatthelevelofexon,chromosomalposition,alleleandsplicingisoformswhichareprovidedbyRNA-seq,developmentofpowerfulcomputationaltoolsforexpressiondataanalysisishighlydesirable(16).
Inthisarticle,wedevelopcomputationalmethodstoaddresschallengesarisingfromco-expressionnetworkin-ferencewithRNA-seqdata.Toexploreobservedexpres-sionvariationinexonsoringenomicpositionacrossthegenes,weuseanationanalysis(CCA)普ordinary通的
singlevariatecanonicalcorrel-thatquanti?esthebetweenalinearcombinationof量the化
correlation
expressionsatexonlevelsorpositionlevelsinonegeneandanothersuchcom-binationofexpressionsinasecondgenetoconstructco-expressionnetworks.Speci?cally,theexpressionlevelateachexon,orexpressionlevelateachgenomicposition,willbeconsideredasvariables.Theexonexpressionsorgenomicpositional-levelexpressionsoftwogenesformtwolargesetsofvariables.Wewishtostudythoselinearcombinationsofvariablesmosthighlycorrelated.ThegoalofCCAistoseeklinearcombinationsoftwosetsofvariablesthatmaximizethecorrelationbetweentwosetsofvariables.Toachievethis,we?rstidentifythepairoflinearcombinationsthathavethelargestcorrel-ation.Next,weidentifythepairoflinearcombinationshavingthelargestcorrelationamongallpairsuncorrelatedwiththeinitiallyselectedpair,andsoon.Therefore,CCAmeasurestheco-expressionbetweentwogenesthatcantakegenomicpositionandallelelevelsofexpressionsintoconsideration.TomodelASE,wedevelopbivariateCCAtoconstructco-expressionnetworkswithASEdata,allowinglevelsofASEtovaryacrossSNPsandtoconsidercomplicatedpatternsofASEbecauseofallele-speci?csplicingandalternativetranscriptionstartsites(2).BivariateCCAconsidertwosetsofvectorsofmeasurements.Twoallele-speci?cexpressionsateachSNPformavectorfortheSNP.BivariateCCAistoseekafewlinearcombinationsofvectorswithtwoalleles’expressionsthathavethelargestcorrelations.Therefore,twovariateCCAmeasuretheco-expressionsbetweentwogenesthatcanconsiderexpressionsoftwoallelesateachSNP.ToevaluatetheperformanceofCCAforco-expressionnetworkinferencewithRNA-seqdata,theCCAforco-expressionnetworkconstructionisappliedtolungsquamouscellcancer(LUSC)expressiondatafromTCGAandabipolardisorderandschizophre-niaRNA-seqstudy.We?ndthatCCAforco-expressionnetworkconstructionwithRNA-seqdatasubstantially
outperformsthecurrentstatisticalmethods大量的
co-expression胜过
fornetworkconstructionwithmicroarrayexpressiondataoroverallgeneexpressiondata.Aprogramforimplementing实施
thedevelopedCCAfor
PAGE2OF15
co-expressionnetworkconstructioncanbedownloadedfrombioconductor(http://wendang.chazidian.com/)andourlocalwebsitehttp://www.sph.uth.tmc.edu/hgc/faculty/xiong/index.htm.MATERIALSANDMETHODSDataaccess
TheTCGARNA-seqdatasetsarepubliclyavailablefromtheTCGAwebsite(https://tcga-data.nci.nih.gov/tcga/).PathwaysareavailablefromKEGGdatabase(17,18)(http://www.genome.jp/kegg/http://wendang.chazidian.comAmethodforconstructionofgeneco-expressionnetworks
Ageneco-expressionnetworkisconsideredasanundir-ectedgraph,whereageneisrepresentedasanodeandeachedgeconnectingtwonodesisregardedastheco-expressionrelationshipofthetwoconnectedgenes.Constructionofco-expressionnetworksisoftencarriedoutbydetectingthepairwisecorrelationofgeneco-expression.TheCCAistoseekmaximizationofthecorrelationbetweentwolinearcombinationofthevariablesinthedatasets(19).Supposethatwehavepexonsorpositionsinonegeneandqexonsorpositions
inanothergene.Letxð1Þ
jdenotetheexpressionofthej-thexonorthenumberofreadsatthej-thgenomicposition
withinthe?rstgene.Wecansimilarlyde?nexð2Þ
forthesecondgene.LetXð1Þ¼½xð1Þ1,...,xð1ÞT
andj
Xð2Þ¼½xð2Þð2ÞT
p??
we1,...,xassumethatq??.Fortheconvenienceofpresenta-tion,p q.Let
X¼??Xð1Þ ??Æ
Xð2ÞandƼcovðX,XÞ¼Æ1112
Æ:21Æ22Constructionofco-expressionnetworksisimplementedbyseekingmaximizationofcorrelationÞcoef?cientsbetweenlinearcombinationU¼aTXð1forthe?rstgeneandlinearcombinationV¼bTXð2Þ:maxa,bcorrðU,VÞ¼aTÆ12b
aTÆ11abTÆ22b
ð1Þ
Solutionstotheoptimizationproblemeigenvalues??21!??2the2!ÁÁÁ!??2
(1)arethe
oftheRayleighpandtheircorrespondingeigenvectorsofquotientmatrix:
R¼ÆÀ1=2À1
À1=211Æ12Æ22Æ21Æ11:
InRNA-seqdata,weobserveeithermultipleexon
expressionsorsetsofnumberofreadsatgenomicpositionlevelsacrosstwogenes.Theexon-levelorgenomicpositional-levelexpressionsformtwosetsofvari-ablesortwovectorsofvariables.Canonicalcorrelationbetweentwogenesisto?ndthepairoflinearcombin-ationsofthevariablesdeterminedbyaandbsuchthattheircorrelationismaximized.The?rstpairoflinearcombinationiscalledthe?rstpairofcanonicalvariables.Theirlargestcorrelationiscalledthe?rstcanonicalcor-relations.Next,weidentifythepairoflinearcombinations
Downloaded from http://wendang.chazidian.com/ at Harbin Medical Uniersity on May 1, 2013
RNA-SEQ
PAGE3OF15
thathavethelargestcorrelationamongallpairsuncorrelatedwiththeinitiallyselectedpairandcalledthesecondpairofcanonicalvariables,andsoon.The?rstcanonicalcorrelationisequaltothesquarerootofthelargesteigenvalue??1ofthematrixR,thesecondcanonicalcorrelationisequaltothesquarerootofthelargesteigenvalue??2ofthematrixR,andsoon.LetebetheeigenvectorofthematrixRassociatedwiththejeigenvalue??j.Then,thevectorsofcoef?cientsaandbaregivenby
aj¼ÆÀ1
ÆÀ1=2À1=211Æ1222ej,andbj¼Æ22ej,j¼1,...,p:
Ordinarycorrelationcoef?cientcanonlymeasurethelinearrelationshipbetweentwovariables.Whenitisappliedtoquantifyco-expressionbetweentwogenes,exon-levelexpressionsorgenomicpositional-levelexpres-sionsneedtobeaggregatedintooverallexpressions.Theexon-levelandgenomicpositional-levelexpressionvariationinformationcannotbepreservedintheordinarycorrelation.Canonicalcorrelationisextensionofordinarycorrelationbetweentwovariablestocanonicalcorrelationbetweentwosetsofvariables.Therefore,canonicalcorrelationformeasuringco-expressionbetweentwogenesinRNA-seqdatacanmoreaccuratelyquantifythelinearrelationshipbetweentwogenesthantheordinarycorrelation.
LetPkbetheP-valuepoftheteststatistic(20)
Tk¼À½nÀðp+qÞ??P
logð1À??^2i¼k+1
iÞwithdistribution
2ðpÀkÞðqÀkÞ,wherenissamplesizefortestingthenull
hypothesisH0:??k¼...¼??p¼0.Weassignaweighttotheedgeconnectingtwogenes:
P
p??iIðlogPiÞ
w¼i¼1ð2Þ
IðlogPiÞ
i¼1
whereIðlogPÞ¼
??
0P>0:05
ÀlogPP 0:05
.Whenthedenom-inatoriszero,theweightiszero.
ThemethodfordeterminingthethresholdforretaininganedgebytheCCAmethodishardthresholdmethod(8).We?rstrankededgesbytheirweightsfromthelargesttothesmallest.Wethenselectedgesbypre-determinednumberofedgesorpercentageofedges.Inthisarticle,weselected5%ofedgeswithtopweights.
Theedgeswithranklargerthanthresholdareretainedinthenetwork.
Ingeneral,wehavemultiplecanonicalvariablesandcanonicalcorrelations.Tofullyusecanonicalcorrelationstocharacterizetherelationshipsbetweentwosetsofvariables,weintroducetheweightw.Thelargerthecontributiontotheedgeweight,thelargertheeigenvalueorcanonicalcorrelation.Ifthei-thcanonicalcorrelationisnotsigni?cant(P>0.05),itscontributiontotheedgeweightwillbesmall.Therefore,IðlogðpÞÞisgivenvalueof0.Theedgeweightde?nedbyEquation(2)canfullyusecanonicalcorrelationinformationtomeasurethedegreesofco-expressionoftwogenes.NucleicAcidsResearch,2013,Vol.41,No.8e95
BivariateCCAforconstructionofco-expressionnetworks
withASEdata
WedevelopnovelbivariateCCAforconstructionof
co-expressionnetworkswithASEdata.Letxð1Þjandxð2Þj
bethenumberofreadsofthemajorandminoralleleatthej-theSNPinthegene,respectively.Wecansimilarly
de?neyð1Þð2Þð1Þð2Þ
jandyjforanothergene.LetX¼½x1,x1,...,
xðp1Þ,xðð1Þð2Þp2Þ??TandY¼½y1,y1,...,yðq1Þ,yðq2Þ??T
.De?nelinear
combinationsU¼aTXandV¼bTY,wherea¼½??ð1Þ
1,??ð2Þ1,...,??ðp1Þ,??ðp2Þ??Tandb¼½??ð1Þð2Þ1,??1,...,??ðq1Þ,??ðq2Þ??T.Theselinearcombinationscanberewrittenas
U¼½??ð1Þ??TXð1Þ+½??ð2Þ??TXð2ÞandV¼½??ð1Þ??TYð1Þ+½??ð2Þ??TYð2Þ,
where??ð1Þ¼½??ð1Þ1,...,??ðp1Þ??T,??ð2Þ¼½??ð2Þ1,...,??ðp2Þ??T,??ð1Þ
¼½??ð1Þ,...,??ðq1Þ??T,??ð2Þ¼½??ð2Þ1,...,??ðq2Þ??T,Xð1Þ¼½xð1Þ11,...,
xðp1Þ??TXð2Þ¼½xð2Þ1,...xðp2Þ??T,Yð1Þ¼½yð1Þ1,...,yð1ÞT
andYð2Þ¼½yð2Þq??
1,...,yð2ÞT
De?nethecovarianceq??.
matrices:
Ƽ??Æ ÆxxÆ
Æxyyx
yy
,where
Æ??
Æxð1Þxð1ÞÆxð1Þxð2Þ
xx¼Æ,
xð2ÞxðÆ??1ÞÆxð2Þxð2Þ
Æð1Þð1ÞÆð1Þð2Þ
xy¼ÆTxyxy
yx¼ÆandðÆ??x2Þy
ð1Þ
Æxð2Þyð2ÞÆyð1Þyð1ÞÆyð1Þyð2Þ
yy¼
Æ:yð2Þyð1ÞÆyð2Þyð2Þ
TheCCAseekstomaximize
max??,??corrðU,VÞ¼??TÆxy??
ð3where??¼????TÆxx????TÆyy??
Þ??ð1Þ ????
ð1Þ ??ð2Þand??¼??ð2Þ
.Thesolutionstotheoptimizationproblem(3)aretheeigenvectorsofthematrixwiththeeigenvalues??1!??2!...!??2p:
R¼ÆÀ1=2À1À1=2
xxÆxyÆyyÆyxÆxx:
Ourformulationconsidersthecorrelationbetweenthe
expressionsoftwoalleles.Ifwedonottaketheircorrel-ationsintoaccount,thetwovariateCCAwillbecometwosinglevariateCCA.Again,letPTkbek¼À½nÀðp+qÞ??Pthe2pP-valueofthei¼k+1logð1À??^teststatistic(20)
2iÞ
withdistribution 2ð2pÀkÞð2qÀkÞ,wherenissamplesizefortestingthenullhypothesisH0:??k¼...¼??2p¼0.Weassignaweighttotheedgeconnectingtwogenes:
P
2p??iIðlogPiÞ
w¼
i¼1P2pð4Þ
IðlogPiÞ
i¼1
Downloaded from http://wendang.chazidian.com/ at Harbin Medical Uniersity on May 1, 2013
RNA-SEQ
e95NucleicAcidsResearch,2013,Vol.41,No.8PAGE4OF15whereIðlogPÞ¼??0P>0:05
ÀlogPP 0:05.Whenthedenom-SupplementaryFigureS1.Itwasclearlyshownthatthe
inatoriszero,theweightiszero.accuracyofCCAmethodforconstructionofco-SimilartosingleCCAforconstructionofco-expressionexpressionnetworkswasmuchhigherthanthatbynetworks,afterweranktheweights,wealsouserankGLASSOunderalldifferentnetworksizes.Thenweproceduretoprunethenetworks.studiednon-smallcelllungcancerpathwayinKEGG
withLUSCdataset.Afterdiscardingtheisolated
GraphicalLASSOinthepathwayandmatching抛弃nodes
totheTCGALUSC
RNA-seqlevel3data,weincluded44genesinthe
Sparseundirectedgraphicalmodelscanbeestimatedbyanalysis.Theconstructedco-expressionnetworksforthetheuseofL1(LASSO:leastabsoluteshrinkageandnon-smallcelllungcancerpathwaybytheCCAandselectionoperator)regularization(21).WeassumethatGLASSO(21)methodwereshowninFigure1.Edgestheoverallexpressionsofgeneshaveamultivariatewithredcolourwereintheco-expressionnetworkcon-Gaussiandistributionwithmean??andcovariancestructedonlybyCCA.EdgeswithbluecolourwereinmatrixÆ.Itisshowntheco-expressionnetworksconstructedbybothCCAinversematrixÆÀ1thatiftheijthecomponentoftheiszero,thenvariablesiandjareandGLASSO.Edgeswith蓝cyancolourwereintheconditionallyindependent,giventheothervariables.co-expressionnetworksconstructed绿色onlybyGLASSO.Therefore,co-expressionnetworkscanbeconstructedbyItconsistedoffourpathways:ErbBsignallingpathway,estimatingtheinverseofcovariancematrixÆÀ1throughMAPKsignallingpathway,PI3KpathwayandL细apoptosis胞1regularization.pathway.Figure1showedthatEGF(epidermalgrowth凋亡
factor)–EGFR–PI3K/Akt–apoptosissignal表皮pathwayand
MAPK(Raf–MEK–ERK)signalpathwaywereinthe
RESULTSco-expressionnetworkconstructedbytheCCAmethod.CanonicalcorrelationanalysisforconstructionofHowever,EGF–EGFR–PI3KpathwayandMAPKco-expressionnetworkswithexon-levelexpressiondatapathwayconnectionswerenotintheco-expressionAgeneco-expressionnetworkisrepresentedasanundir-networkconstructedbyGLASSO.Recentstudiesun-ectedgraph,whereeachnodedenotes未受指导的covered发现(22)thattheEGFstimulates刺激theproductionofpreciselyageneexpressionpro?le,表ageneormore
and示eachedgeconnect-白interleukin(IL)-8fromlungcancercells,whichinturn
ing精确two的nodesindicatessigni?cantco-expressionrelation-activates细胞介素EGFRandsignallingpathwayofPI3K/Akt.shipsofthetwogenes.ToexploreexonswithvaryingPI3K/Aktpathwayactivationplaysa重crucial要的roleinlungexpressioninformation,wedevelopedaCCA-basedcancerdevelopmentandproliferation.
wasinvolvedingenetranscription,增殖Raf–MEK–ERK
regulationofcellmethodforconstructionofco-expressionnetworks.Thesurvivalandangiogenesisandwasassociatedwithlungcanonicalcorrelationsmeasurethestrengthofassociationmetastasis(23).血管生成
betweenthetwosetsofexonexpression.ToillustratetheNetwork转移topologyplaysanimportantroleintheapplicationofCCAforconstructionofco-expressionfunctionandinformationprocessingofbiologicalnetworkswithexonexpressiondata,theCCAwasnetworks(24).AssortativityandcentralizationaretwoappliedtoLUSCRNA-seqdatasetfromTCGAimportanttopology协measures调集中化database.LUSCconsistsof242samples(225caseisapreference拓扑for学ofnetworks.Assortativityanetwork’snodestoattachtosamplesand17controlsamples).Here,weonlyusedatasimilar倾nodes.向other
Assortativityismeasuredbytheassort-belongingtothecasegroup.TheCCAmethodwasativitycoef?cientthatisde?nedasthePearsoncorrelationcomparedwithagraphicLASSO(GLASSO)(21)coef?cientofdegreebetweenpairsoflinkednodes(25).methodthatusedoverallgeneexpressiontoconstructIftheassortativitycoef?cientwaspositive,thenetworkco-expressionnetwork(fordetail,see‘Materialsandwassaidtobeassortative.Ontheotherhand,iftheMethods’section).Asimplerank-basedcut-offmethodassortativitycoef?cientwasnegative,thenetworkwaswasusedtoprunethenetwork(fordetail,see‘Materialsrecognizedasdisassortative.WeobservethatinandMethods’删section).减networks,highly异connected配的social
nodestendtobeconnectedWe?rstrandomlyselectedTheTCGARNA-seqlevel3withotherhighdegreenodes.Theassortativitycoef?cientdatawith50,200,400,600and1000genes,resultingininsocialnetworksis,hence,positive(25).However,?veRNA-seqdatasets.WethenappliedGLASSOandtechnologicalandbiologicalnetworkstypicallyshowCCAtothese?vesampledRNA-seqdatasetstoconstructthathigh-degreenodestendtoattachtolow-degreeco-expressionnetworksthatwerecalledtheoriginalnodes.Theirassortativitycoef?cientisnegative.For
co-expressionnetworksasthebasistoevaluatetheper-randomnetworks,theassortativitycoef?cienttendedtoformanceofCCAandGLASSOforconstructionofbenearlyzero(Wikipedia,thefreeencyclopedia).Thecen-co-expressionnetworks.Weusedbootstrapping
re-samplingRNA-seqdata1000timesfrom引导程fortralityofavertexeach序of?verelativeimportance顶点withinagraphthatdeterminesthe
ofavertexwithinthegraphisan
RNA-seqdatasets.TheCCAandGLASSOmethodsimportantconceptinnetworktheory.Centralizationiswereappliedtothere-sampledRNA-seqdatatoconstructbasedonthe概念conceptofcentrality.Itisde?nedasco-expressionnetworksfortestingtheaccuracy(howmeasuringthesumindifferencesincentralitybetweenmanyedgesintheoriginalnetworksarereservedinthethemostcentralnodeinanetworkandallothernodes.reconstructedco-expressionnetworksfrom保the留re-sampledItattemptstoquantifythelevelofanetworkabouthowRNA-seqdatasets).Theresultswereshownincentralizeditwasaroundparticularnodes(26).Table1Downloaded from http://wendang.chazidian.com/ at Harbin Medical Uniersity on May 1, 2013
RNA-SEQ
PAGE5OF15NucleicAcidsResearch,2013,Vol.41,No.8e95Figure1.Thesharednetworkstructurebynon-smalllungcancerpathwayinKEGGandreconstructedco-expressionnetworksusingtheCCAandGLASSOmethods.
Table1.Topologypropertyofco-expressionnetworksgeneratedbyexpressionswasmorebiologicallymeaningfulthanthatbyCCA,GLASSO,randomandKEGGoverallgeneexpressions.
MethodAssortativityCentralizationTofurtherevaluatetheperformanceoftheCCAfor
Mean(SD)Mean(SD)constructionofco-expressionnetworks,weappliedboth
CCAandGLASSOmethodstoanotherTCGARNA-seq
CCAÀ0.3937(0.0407)0.7606(0.0078)
GLASSOÀ0.0482(0.0868)0.6297(0.0458)dataset(uterinecorpusendometrioidcarcinoma)where
RandomÀ0.0489(0.0064)0.5666(0.0033)416casesamplesofUCECwereusedinthestudy.KEGGÀ0.23050.7257Thereconstructedpathwayfortheendometrial
bytheCCAandGLASSOmethod子宫were内膜cancer
shownin
SupplementaryFigureS2.Fromthis?gure,itwas
showedthatassortativityandcentralizationofshownthatonlytheCCAmethoddetectedtheMAPKco-expressionnetworksconstructedbytheCCA,(Raf–MEK–ERK)signalpathwayintheco-expressionGLASSO,randomselectionandtruenon-smallcelllungnetwork.Wealsopresentthenetworktopologycompari-cancerpathwayinKEGG,wherestandarddeviationsonoftworeconstructednetworkaswellasrandomcalculatedby1000re-sampling.Weobserved偏差was
thatnetworkandtrue
topologyoftheco-expressionnetworkconstructedbytheSupplementary子endometrioid
Table宫内膜carcinomapathwayin
S1.样的Thistableshowedthatthe
GLASSO(overallgeneexpression)wasclosetotheco-expressednetworkreconstructedbytheCCAmethodrandomnetwork;however,topologyoftheco-expressionwasclosertothepathwayinKEGG,butthetopologyofnetworkconstructedwasclosetothestructureoftrueco-expressionnetworkinferredbytheoverallexpressionsnon-smallcelllungcancerpathwayinKEGG.Table1andGLASSOwasmoresimilartotherandomone.indicatedthattheco-expressionnetworksconstructedby
RNA-seqexonexpressionswerehighlycentralizedandCCAforco-expressionnetworkconstructionwithdisassortative,whichwereinherenttopologyposition-levelRNA-seqdata
thebiologicalnetworks.This固有的featuresinfurtherdemonstratedthatThecurrentmethodsforco-expressionnetworkconstruc-theco-expressionnetworkconstructedbyRNA-seqexontionwithRNA-seqdataareto崩collapse溃倒塌the
内容需要下载文档才能查看position-levelDownloaded from http://wendang.chazidian.com/ at Harbin Medical Uniersity on May 1, 2013
下载文档
热门试卷
- 2016年四川省内江市中考化学试卷
- 广西钦州市高新区2017届高三11月月考政治试卷
- 浙江省湖州市2016-2017学年高一上学期期中考试政治试卷
- 浙江省湖州市2016-2017学年高二上学期期中考试政治试卷
- 辽宁省铁岭市协作体2017届高三上学期第三次联考政治试卷
- 广西钦州市钦州港区2016-2017学年高二11月月考政治试卷
- 广西钦州市钦州港区2017届高三11月月考政治试卷
- 广西钦州市钦州港区2016-2017学年高一11月月考政治试卷
- 广西钦州市高新区2016-2017学年高二11月月考政治试卷
- 广西钦州市高新区2016-2017学年高一11月月考政治试卷
- 山东省滨州市三校2017届第一学期阶段测试初三英语试题
- 四川省成都七中2017届高三一诊模拟考试文科综合试卷
- 2017届普通高等学校招生全国统一考试模拟试题(附答案)
- 重庆市永川中学高2017级上期12月月考语文试题
- 江西宜春三中2017届高三第一学期第二次月考文科综合试题
- 内蒙古赤峰二中2017届高三上学期第三次月考英语试题
- 2017年六年级(上)数学期末考试卷
- 2017人教版小学英语三年级上期末笔试题
- 江苏省常州西藏民族中学2016-2017学年九年级思想品德第一学期第二次阶段测试试卷
- 重庆市九龙坡区七校2016-2017学年上期八年级素质测查(二)语文学科试题卷
- 江苏省无锡市钱桥中学2016年12月八年级语文阶段性测试卷
- 江苏省无锡市钱桥中学2016-2017学年七年级英语12月阶段检测试卷
- 山东省邹城市第八中学2016-2017学年八年级12月物理第4章试题(无答案)
- 【人教版】河北省2015-2016学年度九年级上期末语文试题卷(附答案)
- 四川省简阳市阳安中学2016年12月高二月考英语试卷
- 四川省成都龙泉中学高三上学期2016年12月月考试题文科综合能力测试
- 安徽省滁州中学2016—2017学年度第一学期12月月考高三英语试卷
- 山东省武城县第二中学2016.12高一年级上学期第二次月考历史试题(必修一第四、五单元)
- 福建省四地六校联考2016-2017学年上学期第三次月考高三化学试卷
- 甘肃省武威第二十三中学2016—2017学年度八年级第一学期12月月考生物试卷
网友关注
网友关注视频
- 【部编】人教版语文七年级下册《老山界》优质课教学视频+PPT课件+教案,安徽省
- 苏科版数学七年级下册7.2《探索平行线的性质》
- 化学九年级下册全册同步 人教版 第22集 酸和碱的中和反应(一)
- 冀教版小学数学二年级下册第二单元《租船问题》
- 19 爱护鸟类_第一课时(二等奖)(桂美版二年级下册)_T3763925
- 沪教版八年级下册数学练习册20.4(2)一次函数的应用2P8
- 8.练习八_第一课时(特等奖)(苏教版三年级上册)_T142692
- 苏科版八年级数学下册7.2《统计图的选用》
- 第19课 我喜欢的鸟_第一课时(二等奖)(人美杨永善版二年级下册)_T644386
- 每天日常投篮练习第一天森哥打卡上脚 Nike PG 2 如何调整运球跳投手感?
- 化学九年级下册全册同步 人教版 第18集 常见的酸和碱(二)
- 北师大版数学四年级下册第三单元第四节街心广场
- 沪教版八年级下册数学练习册21.3(3)分式方程P17
- 冀教版小学英语四年级下册Lesson2授课视频
- 六年级英语下册上海牛津版教材讲解 U1单词
- 二年级下册数学第一课
- 外研版八年级英语下学期 Module3
- 3.2 数学二年级下册第二单元 表内除法(一)整理和复习 李菲菲
- 北师大版八年级物理下册 第六章 常见的光学仪器(二)探究凸透镜成像的规律
- 【部编】人教版语文七年级下册《泊秦淮》优质课教学视频+PPT课件+教案,辽宁省
- 二年级下册数学第三课 搭一搭⚖⚖
- 【部编】人教版语文七年级下册《过松源晨炊漆公店(其五)》优质课教学视频+PPT课件+教案,江苏省
- 沪教版牛津小学英语(深圳用)五年级下册 Unit 1
- 河南省名校课堂七年级下册英语第一课(2020年2月10日)
- 沪教版牛津小学英语(深圳用) 四年级下册 Unit 4
- 人教版历史八年级下册第一课《中华人民共和国成立》
- 第五单元 民族艺术的瑰宝_16. 形形色色的民族乐器_第一课时(岭南版六年级上册)_T3751175
- 第4章 幂函数、指数函数和对数函数(下)_六 指数方程和对数方程_4.7 简单的指数方程_第一课时(沪教版高一下册)_T1566237
- 二年级下册数学第二课
- 沪教版牛津小学英语(深圳用) 四年级下册 Unit 8
精品推荐
- 2016-2017学年高一语文人教版必修一+模块学业水平检测试题(含答案)
- 广西钦州市高新区2017届高三11月月考政治试卷
- 浙江省湖州市2016-2017学年高一上学期期中考试政治试卷
- 浙江省湖州市2016-2017学年高二上学期期中考试政治试卷
- 辽宁省铁岭市协作体2017届高三上学期第三次联考政治试卷
- 广西钦州市钦州港区2016-2017学年高二11月月考政治试卷
- 广西钦州市钦州港区2017届高三11月月考政治试卷
- 广西钦州市钦州港区2016-2017学年高一11月月考政治试卷
- 广西钦州市高新区2016-2017学年高二11月月考政治试卷
- 广西钦州市高新区2016-2017学年高一11月月考政治试卷
分类导航
- 互联网
- 电脑基础知识
- 计算机软件及应用
- 计算机硬件及网络
- 计算机应用/办公自动化
- .NET
- 数据结构与算法
- Java
- SEO
- C/C++资料
- linux/Unix相关
- 手机开发
- UML理论/建模
- 并行计算/云计算
- 嵌入式开发
- windows相关
- 软件工程
- 管理信息系统
- 开发文档
- 图形图像
- 网络与通信
- 网络信息安全
- 电子支付
- Labview
- matlab
- 网络资源
- Python
- Delphi/Perl
- 评测
- Flash/Flex
- CSS/Script
- 计算机原理
- PHP资料
- 数据挖掘与模式识别
- Web服务
- 数据库
- Visual Basic
- 电子商务
- 服务器
- 搜索引擎优化
- 存储
- 架构
- 行业软件
- 人工智能
- 计算机辅助设计
- 多媒体
- 软件测试
- 计算机硬件与维护
- 网站策划/UE
- 网页设计/UI
- 网吧管理