Transcript

kaitlin thaneySciPy, 13 july 2011

austin, texas

the reality of ‘digital science’

Wednesday, 13 July 2011

xi. background

Wednesday, 13 July 2011

about me

Wednesday, 13 July 2011

Digital Science(the company)

Wednesday, 13 July 2011

investment armincubator rolein-house dev

Wednesday, 13 July 2011

tiered approachbuild to scale

researcher-focused

Wednesday, 13 July 2011

1. science, tech, and moving online

Wednesday, 13 July 2011

Wednesday, 13 July 2011

Wednesday, 13 July 2011

research

idea

experiment

lit review discovery

materials

publish

share results

retestanalyze

collect data

Wednesday, 13 July 2011

blocking points

idea

experiment

lit review discovery

materials

publish

share results

retestanalyze

collect data

(to name a few ... )

Wednesday, 13 July 2011

access

analysis

disseminationWednesday, 13 July 2011

text texttext

Wednesday, 13 July 2011

discovery & delivery

Wednesday, 13 July 2011

changes at the workbench

Wednesday, 13 July 2011

annotation & curation

Wednesday, 13 July 2011

social & administrative

Wednesday, 13 July 2011

gaps still exist

Wednesday, 13 July 2011

2. key constituencies

Wednesday, 13 July 2011

(3)

Wednesday, 13 July 2011

machines

researchers

decision makers

Wednesday, 13 July 2011

machines

researchers

decision makers

Wednesday, 13 July 2011

...annotation

markupsearch

discovery“behind the

scenes”...

Wednesday, 13 July 2011

[brief interlude]

Wednesday, 13 July 2011

digitisation of the scholarly canon

(content is still king)

Wednesday, 13 July 2011

Wednesday, 13 July 2011

Wednesday, 13 July 2011

not nearly there yet ...

Wednesday, 13 July 2011

Wednesday, 13 July 2011

barriers to “access”

Wednesday, 13 July 2011

still the starting point

Wednesday, 13 July 2011

patents are no better(in many cases, worse)

Wednesday, 13 July 2011

Wednesday, 13 July 2011

Wednesday, 13 July 2011

Wednesday, 13 July 2011

Wednesday, 13 July 2011

Wednesday, 13 July 2011

can streamline

Wednesday, 13 July 2011

name disambiguation

Wednesday, 13 July 2011

10,11-dihydro-5-methyl-5H-dibenzo[b,e][1,4]diazepin-11-one

(still strains the minds of the best)

Wednesday, 13 July 2011

machine readability is key.

agreement is hard.

Wednesday, 13 July 2011

10,11-dihydro-5-methyl-5H-dibenzo[b,e][1,4]diazepin-11-one

Wednesday, 13 July 2011

“everything is metadata ...

everything can be a label.”

- david weinberger

Wednesday, 13 July 2011

Wednesday, 13 July 2011

Wednesday, 13 July 2011

Wednesday, 13 July 2011

machines

researchers

decision makers

Wednesday, 13 July 2011

a few edge cases (though no field is perfect)

Wednesday, 13 July 2011

Wednesday, 13 July 2011

Wednesday, 13 July 2011

CC-BY-2.0 - Plaxco Lab - http://www.flickr.com/photos/34857812@N04/

Wednesday, 13 July 2011

Wednesday, 13 July 2011

trackingexpiration calibration

Wednesday, 13 July 2011

the non-digital

+ordering

processing

Wednesday, 13 July 2011

protocols parameterscalibration

misc. lit

Wednesday, 13 July 2011

managing information

different types of “data”

Wednesday, 13 July 2011

often gigabytes, not terabytes

Wednesday, 13 July 2011

“i invented a folder based system ...”

Wednesday, 13 July 2011

“i invented a folder based system ...”

“yeah, we had a LIMS. it only ever got used to store photos

from lab nights out.”

Wednesday, 13 July 2011

Wednesday, 13 July 2011

why?

experimentation reliance

data moves, grows legs

funder/instit’n pressure

Wednesday, 13 July 2011

Wednesday, 13 July 2011

Wednesday, 13 July 2011

Wednesday, 13 July 2011

Wednesday, 13 July 2011

machines

researchers

decision makers

Wednesday, 13 July 2011

rewards, incentives the “ why ”

Wednesday, 13 July 2011

data capture(of a different sort)

Wednesday, 13 July 2011

the “social issue”best practices

behaviour roadblocksdiscipline / researcher specific

Wednesday, 13 July 2011

paper’s still the currency

Wednesday, 13 July 2011

imperfect system

Wednesday, 13 July 2011

“Right now we're going through a Cambrian explosion of metrics.”

- Johan Bollen

Nature 465, 864-866 (2010) | doi:10.1038/465864a

Wednesday, 13 July 2011

there’s been a drastic spike in terms of sheer volume and type

Wednesday, 13 July 2011

citation / impact factorh - index

weighted citations (eigenfactor, sjr)“betweenness centrality”

alt-metrics, etc.

Wednesday, 13 July 2011

difficult to ... harmonise

track /maintain / mapunderstand

(even still measure)

Wednesday, 13 July 2011

administrators / funders =

part of the research cycle

Wednesday, 13 July 2011

Wednesday, 13 July 2011

Wednesday, 13 July 2011

Wednesday, 13 July 2011

3. the reality

Wednesday, 13 July 2011

“the future is here ... just not evenly

distributed yet.”- William Gibson

Wednesday, 13 July 2011

changing understandings,

paradigms

Wednesday, 13 July 2011

technology can helpdesign decisions are key

plan for the irrational

Wednesday, 13 July 2011

more efficient researchincrease productivityenable reproducibility

Wednesday, 13 July 2011

thank you.

k.thaney@digital-science.comwww.digital-science.com

@kaythaney

Wednesday, 13 July 2011

top related