Top Banner
Supplementary Materials SmartR: An open-source platform for interactive visual analytics for translational research data Sascha Herzinger, Wei Gu, Venkata Satagopam, Serge Eifes, Kavita Rege, Adriano Barbosa Da Silva, Reinhard Schneider Here we provide a few example implementations of visual analytics using the SmartR framework. All tools shown in the figures can be accessed via the public server at http://smartR.lcsb.uni.lu 1) Dynamic Heat Map: For a detailed list of the features for this workflow, please refer to the associated paper. This figure is included for completeness and includes a link to a video that demonstrates the dynamic nature of the heat map. SFig. 1 Dynamic heat map. The shown heat map is fully sortable and contains many interactive elements that are listed in detail in the associated paper. Video URL: https://youtu.be/kLRSOMbKuns
12

Supplementary Materials SmartR: An open-source platform for …orbilu.uni.lu/bitstream/10993/33786/2/btx137_Supp.pdf · 2017-12-22 · Microsoft Word - btx137_Supp.docx Created Date:

Oct 21, 2018

Download

Documents

phamminh
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Supplementary Materials SmartR: An open-source platform for …orbilu.uni.lu/bitstream/10993/33786/2/btx137_Supp.pdf · 2017-12-22 · Microsoft Word - btx137_Supp.docx Created Date:

SupplementaryMaterials

SmartR:Anopen-sourceplatformforinteractivevisualanalyticsfortranslationalresearchdataSaschaHerzinger,WeiGu,VenkataSatagopam,SergeEifes,KavitaRege,AdrianoBarbosaDaSilva,ReinhardSchneider

HereweprovideafewexampleimplementationsofvisualanalyticsusingtheSmartRframework.Alltoolsshowninthefigurescanbeaccessedviathepublicserverathttp://smartR.lcsb.uni.lu

1)DynamicHeatMap:Foradetailedlistofthefeaturesforthisworkflow,pleaserefertotheassociatedpaper.Thisfigureisincludedforcompletenessandincludesalinktoavideothatdemonstratesthedynamicnatureoftheheatmap.

SFig.1Dynamicheatmap.Theshownheatmapisfullysortableandcontainsmanyinteractiveelementsthatarelistedindetailinthe

associatedpaper.VideoURL:https://youtu.be/kLRSOMbKuns

Page 2: Supplementary Materials SmartR: An open-source platform for …orbilu.uni.lu/bitstream/10993/33786/2/btx137_Supp.pdf · 2017-12-22 · Microsoft Word - btx137_Supp.docx Created Date:

2)CorrelationAnalysis:Thisworkflowconsistsofascatterplot,histogramsfortherespectiveaxes,andalegendwithstatistics,suchascorrelationcoefficientandp-value.Thescatterplotallowsuserinteractionbyareaselection,whichtriggersthere-computationofthecorrelationstatistics,theregressionlineposition,andthehistograms.Thecontextmenu,availableviaright-click,revealstheoptionstozoom,resetorexcludetheselectedarea.

SFig.2CorrelationAnalysis.Shownisascatterplotwithhistogramsfortherespectiveaxes.

Statisticsandotherplotelementsadaptdynamicallytocertainuser-triggeredevents.VideoURL:https://youtu.be/IP7vBSQsaFA

Page 3: Supplementary Materials SmartR: An open-source platform for …orbilu.uni.lu/bitstream/10993/33786/2/btx137_Supp.pdf · 2017-12-22 · Microsoft Word - btx137_Supp.docx Created Date:

3)LineGraph:Themainpurposeofthelinegraphistovisualizelongitudinaldata.Mainfeaturesarethemanuallysortablex-axis,whichisimportantincasethedataarenottime-annotated(unordered),hover-eventsthathighlightcertaindataorrevealdetailedinformation(s.SFig.3),andthepossibilitytoswitchinstantlybetweenseveraldisplaymodes(e.g.meanwithSEMerrorbars,medianwithSDerrorbars,orrawdata).

SFig.3Linegraph.Shownisaplotthatvisualizesmachinegeneratedtimeseriesdata.Besidesmanyvisualhelpersthataretriggeredby

mouse-overevents,thisvisualizationhasamanualsortablex-axisanddifferentmethodsfordefiningtheshowntimelines.VideoURL:https://youtu.be/GFCmJysbTyM

Page 4: Supplementary Materials SmartR: An open-source platform for …orbilu.uni.lu/bitstream/10993/33786/2/btx137_Supp.pdf · 2017-12-22 · Microsoft Word - btx137_Supp.docx Created Date:

4)Boxplots:Thisworkflowfeaturesboxplotswithadiamond-likeshapethatrepresentsmeanwithstandarddeviation.Besidestheoptiontolog-transformyourdatabeforevisualization,itisworthmentioningthatthepossibilityexiststoselectandtreatsinglerowsofmicro-arraydataasnormalnumericalinputforthisworkflow.

SFig.4Boxplots.CurrentlythisistheonlyvisualizationthatisusingPlotly(PlotlyTechnologiesInc.Collaborativedatascience.Montréal,QC,2015.https://plot.ly.)asavisualizationlibrary.Thisisincludedforcompletenessandtodemonstratethattheframeworkisnotlimitedto

certainvisualizationlibraries.

Page 5: Supplementary Materials SmartR: An open-source platform for …orbilu.uni.lu/bitstream/10993/33786/2/btx137_Supp.pdf · 2017-12-22 · Microsoft Word - btx137_Supp.docx Created Date:

5)VolcanoPlot:Thisworkflowhelpstoidentifymicro-arrayfeaturesthathavebothahighlog-foldchangeandasignificantp-value.Besidesatooltipthatshowsthestatisticsforeachdatapoint,thereisatablethatlistsalldatapointswithanabsolutelog-foldchangegreaterthan0.5andap-valuelowerthan0.05.Thesevaluesarerepresentedbylimitersinthescatterplot,whicharedrag-enabledandinstantlyupdatethetableonchange.SimilartothedendrogramfeatureintheSmartRheatmap,wecanlaunchaKEGGpathwayenrichmentanalysisforgenesofinterest.

SFig.5Volcanoplot.Shownisthewidelyknownvolcanoplotthathasbeenenhancedbysomedynamicelements.Thelimitersaredrag-

enabledandtriggeranupdatefortheright-handtabledisplayingthemostsignificantgenes.VideoURL:https://youtu.be/OxEgDs8vebY

Page 6: Supplementary Materials SmartR: An open-source platform for …orbilu.uni.lu/bitstream/10993/33786/2/btx137_Supp.pdf · 2017-12-22 · Microsoft Word - btx137_Supp.docx Created Date:

ShowCase–GuidedAnalysisofParkinsonDisease(PD)dataset

InthefollowingwewilldemonstratethecapabilitiesoftheSmartRHeatMapinashow-casewiththeGEOstudyGSE7621,whichcontains"ExpressiondataofsubstantianigrafrompostmortemhumanbrainofParkinson'sdiseasepatients(PD)”1.Thestudyalsocontainspatientswithoutthedisease,servingasacontrolgroup.

1LesnickTG,PapapetropoulosS,MashDC,Ffrench-MullenJetal.Agenomicpathwayapproachtoacomplexdisease:axonguidanceandParkinsondisease.PLoSGenet2007Jun;3(6):e98.PMID:17571925

SFig.5TheComparisonTab.Thisisourfirstviewwhenladingthewebpage.Ontheleftside,ourdataarerepresentedinatree-likestructure.Ontherightsidewecan,basedonthisdata,definecaseandcontrolgroup.

SFig.6aThei2b2Tree.Forthisshowcase,weexpandthedatatreetoexposecaseandcontrolgroupoftheGSE7621studyasshowninthefigure.

SFig.7Definingcaseandcontrolgroup.Bydraggingtherespectivenodesintothetwosubsetwindows,wedefineourcohortsforthefollowinganalysis.Inourcase,wewanttocomparePDsubjectswiththehealthycontrolgroup.

SFig.8TheSmartRTab.OnceourgroupsaredefinedweswitchtotheSmartRtabandselectthe‘HeatmapWorkflow’asshown.

Page 7: Supplementary Materials SmartR: An open-source platform for …orbilu.uni.lu/bitstream/10993/33786/2/btx137_Supp.pdf · 2017-12-22 · Microsoft Word - btx137_Supp.docx Created Date:

SFig.9aTheFetchTab.Afterselectingaworkflow,wearepresentedwithseveralboxes.Theseboxescanbeusedtoassignrolestoourdatanodesfromthetreeontheleft.Inthisshowcaseweobviouslymustdefinetheexpressiondatawewishtoanalyze,butarefreetoincludeclinicaldata,aswewillseelater.

SFig.6bThei2b2Tree.Wefurtherexpandthetreetoshowthesubjectgenderandtheexpressiondata.

SFig.9bTheFetchTab.Again,byusingdrag-and-drop,wedefinewhatdatawewishtoincludeinouranalysis.Expressiondataaredraggedtotheleft-mostboxandthetwocategoricalgendernodesaredraggedtotheright-most.Aswewillseeinamoment,thiswillallowustorelatesubjectstotheirrespectivegender.

SFig.9cTheFetchTab.Nowwepressthebutton“FetchData”andwaitamomentuntilalldataareloadedandcanseeasmallsummaryofstatistics.Inthebackgroundthisstepfetchesthejustdefinednodesfromthedatabaseandassignsthesubjectstotheirrespectivecohortthatwedefinedearlier.

Page 8: Supplementary Materials SmartR: An open-source platform for …orbilu.uni.lu/bitstream/10993/33786/2/btx137_Supp.pdf · 2017-12-22 · Microsoft Word - btx137_Supp.docx Created Date:

SFig.10TheRunAnalysisTab.Nowwecanswitchtothe“RunAnalysis”tab,whereitispossibletosetseveralparameterstotweaktheanalysisscriptbeforecreatingtheheatmap.Forthisshowcase,thedefaultparametersareagoodchoice.

SFig.11aTheHeatMap.Oncetheanalysisisfinishedthedynamicheatmapiscreated.Visiblearethetopgenesaccordingtoadifferentialexpressionanalysisexecutedbasedontheparametersofthepreviousfigure.Featuresoftheheatmapareintroducedasneededinthefurthercourseofthisshowcase.

SFig.11bTheHeatMap.Inthelowerrightcornerofthescreenwecanfindseveraloptionsthatwillmodifyourheatmapon-the-fly.Fornow,wewanttoapplyahierarchicalclusteringtoseeifwecanclearlyseparatecaseandcontrolgroup.Theoption“Hierarch.-Eucl.-Avg.”willapplysuchaclusteringwith‘Euclidean’distancemeasureand‘Average’linking.

Page 9: Supplementary Materials SmartR: An open-source platform for …orbilu.uni.lu/bitstream/10993/33786/2/btx137_Supp.pdf · 2017-12-22 · Microsoft Word - btx137_Supp.docx Created Date:

SFig.12TheSubjectDendrogram.Abovetheheatmapwecanseeourcohorts,encodedinyellowandorangecolor,andthepreviouslyaddedgendernodes.Wecandeducetwofactsfromthisimage:1.Theclusteringcouldclearlyseparateourcohortsintotwogroups.2.Thesubjectgenderseemstohavenocorrelationastohowthedataaregrouped.

SFig.13TheRowDendrogram.Inasimilarfashionthegenesarewellseparatedintotwosubgroups.Forfurtheranalysis,wemightwanttoattempttoassociatethesegeneswithaKEGGpathway.Byclickingthemarkeddendrogramnodethecontainedleafs/genesaresendtoanexternalservicetohandlethistask(bioCompendium.Thehigh-throughputexperimentaldataanalysisplatform(2016).Retrievedfromhttp://biocompendium.embl.de/).

Page 10: Supplementary Materials SmartR: An open-source platform for …orbilu.uni.lu/bitstream/10993/33786/2/btx137_Supp.pdf · 2017-12-22 · Microsoft Word - btx137_Supp.docx Created Date:

SFig.14BioCompendium.Sureenough,weseeanassociationofthesegeneswiththePDPathway.ClickingtheredflagontheleftsidewillbringustotheKEGGwebservice.

SFig.15KEGGPathway.Associatedgenesarehighlightedinanimageofthepathwayforfurtherinterpretationoranalysis.

Page 11: Supplementary Materials SmartR: An open-source platform for …orbilu.uni.lu/bitstream/10993/33786/2/btx137_Supp.pdf · 2017-12-22 · Microsoft Word - btx137_Supp.docx Created Date:

SFig.16Statisticsrepresentation.Backintheheatmap,wehavealookattheredbarsontheleftside.Basedonouranalysisparameters,thesebarsinitiallyindicatetheadjustedp-valuefortherespectivegene.Onegeneshowsanextraordinarylowp-value,representedbyalongbar.

SFig.17aManualSorting.Byclickingthebuttonsaroundtheheatmap,wecanmanuallysortbycolumnsorrows.Sortingthecolumnsbasedontheexpressionvaluesforthissinglegeneseemstohavealmostnoimpact.

Page 12: Supplementary Materials SmartR: An open-source platform for …orbilu.uni.lu/bitstream/10993/33786/2/btx137_Supp.pdf · 2017-12-22 · Microsoft Word - btx137_Supp.docx Created Date:

SFig.17bManualSorting.Sureenough,thepreviousobservationisconfirmedbythenearlyperfectsplitbetweencaseandcontrolgroup.Thismakesthegeneagoodcandidateforfurtheranalysis.Ifonewishestoproceedfromhere,clickingthenameofthegenewillredirecttheanalysttoanotherservice,wherefurthergenespecificdetailsarelisted.

SFig.18StatisticsTable.Finally,mostofthedataaredisplayedbelowtheheatmapintableform.Thetableautomaticallyadaptstothecurrentsortingoftheheatmaprowstoincreasevisibility.Thesedataarealsoretrievablebyclickingoneofthebuttonsnearthe“CreatePlot”button.