Top Banner
Boolean, Boolean, bibliometrics, and bibliometrics, and beyond beyond LIS 670 donna Bair-Mundy Part 2
53
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Boolean, bibliometrics, Boolean, bibliometrics, and beyondand beyond

LIS 670donna Bair-Mundy

Part 2

Page 2: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Bibliometrics

Page 3: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Bibliometrics – a defintion

Using quantitative analysis and statistics to examine patterns in academic publishing, now including information transmitted via the World Wide Web

Page 4: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Bibliometrics – what it looks at

• Author productivity

• Citation analysis – impact factors, indexing

• Obsolescence of information resources – half-life of articles

• Dispersion of articles in certain fields

• Word frequencies

Page 5: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Bibliometrics – Purposes (1)

Provide evolutionary models of science, technology, and scholarship

Invisible colleges

Structure of scholarly disciplines

Evolution of a discipline over time

Evolution of concepts

Physics

AstrophysicsBiophysicsSubatomic

particle physics

Global warming

Page 6: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Bibliometrics – Purposes (2)

Assist development of information retrieval methodologies

Provide tools for studying information use and impact

Assist in selection and deselection of resources

Page 7: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Properties of scientific literature

Fragmentary - each paper contributes a small piece to the puzzle under study

Derivative - scientific papers rely heavily on previous research (acknowledged in citations)

Edited - peer reviewed by anonymous referees

Page 8: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Evolution of a discipline

• Purpose: "to reduce to geometric form the activities of the corporate body of anatomical research, and the relative importances from time to time of each country and division of the subject"

• Looked at 6,436 publications dealing with animal anatomy for the period 1543 to 1860

Cole and Eales - 1917 - The history of comparative anatomy—a statistical analysis of the literature

Published in: Sci. Progr. 11:578-596.

Page 9: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Evolution of a discipline

• When were the periods of greater or less importance;

• Where were the centers of activity at any given time?

• As the field grew, how and when did it begin to be subdivided into narrower fields? Looking at publications

within a field to tell us about the field itself

Cole and Eales - 1917 - The history of comparative anatomy—a statistical analysis of the literature

Page 10: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Evolution of a discipline: IS

• Emergence and development of information science

• Relationships and roles of information science within potentially emergent suprasystem of knowledge

Harmon, Glynn - 1971 – On the evolution of information science. JASIS 22(4):235-241

Page 11: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Science, politics, and economics

First to use the term "statistical bibliography"

E. Wyndham Hulme 1923 - Statistical bibliography in relation to the growth of modern civilization

Published by Butler and Tanner Grafton (London)

Purpose: "to ascertain and illustrate by bibliographical data, various stages in the development of the mechanics of civilization"

Page 12: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Hulme (cont’d)Used 13 annual issues of The International Catalogue of Scientific Literature, from 1901 to 1913

Counted author entries for various subjects

Tabulated number of indexed journals by countries (which countries are highly productive in science?)

Page 13: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Hulme (cont’d)

Felt that subject division in a discipline was a sign of growth

Concluded that scientific publication output is influenced by population change and political and economic movements

Page 14: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Research output by countriesJ. Martin van Zyl 2013 – The generalized Pareto distribution fitted to research ouoputs of countries Scientometrics 94(3):1099-1109

Which continent (besides Antarctica) is not represented?

Why might that be?

Why might be the consequences?

Page 15: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Cost of research

Page 16: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Consequencesebola

722 results

ebolavirus

984 results

aids

122,722 resultshiv

196,414 results

Page 17: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Author productivity

Purpose: to "determine, if possible, the part which men of different calibre contribute to the progress of science"

Alfred J. Lotka 1926 - Statistics—the frequency distribution of scientific productivity

Published in: J. Washington Acad. Sci. 16:317-325.

Looked at Chemical Abstracts Index, then Geschichtstafeln der Physik

Page 18: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Lotka's Law

The total number of authors y in a given subject, each producing x publications, is inversely proportional to some exponential function n of x.

Page 19: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Lotka's Law - scientific publications

Inverse square law of scientific productivity

Where:x = number of publicationsy = number of authors credited with x publicationsn = constant (equals 2 for scientific subjects)C = constant

xn • y = C

Page 20: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

1 publ. 2 publ. 3 publ. 4 publ.

Lotka's Law - scientific publications

xn • y = C

No

. of

auth

ors

Page 21: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Relative impacts of journals

Purpose: Select appropriate journals for a chemical library to provide good education for students

Gross & Gross - 1927 - College libraries and chemical education

Published in: Science 66:385-389

Tabulated 3,633 citations found in the 1926 volume of the Journal of the American Chemical Society

First use of citation analysis rather than publication counts

Which journals to collect?

Page 22: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Relative impacts of journalsJournal Citation Reports

“JCR is still the only usable tool to rank thousands of scholarly and

professional journals...”PETER JACSO

Page 23: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Relative impacts of journalsJournal Citation Reports

Page 24: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Relative impacts of journalsJournal Citation Reports

Page 25: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Relative impacts of journalsJournal Citation Reports

Page 26: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Relative impacts of journalsJournal Citation Reports

Page 27: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Citation Indexing

Eugene Garfield 1955 - Citation indexes for science: a new dimension in documentation through association of ideas

Impact factor Influence of an article based on citations to it

Published in: Science 122:108-111.

Science Citation Index

Page 28: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Problems of indexing

The interrelationship between the chemistry and the biological organisms of the soils of Cambodia.

The soil ecology of Kampuchea

1955 1995

Page 29: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

citedarticle

Citation matrix

citedarticle

citedarticle

article

citingarticle

citingarticle

citingarticle

citingarticle

citingarticle

citingarticle

citingarticle

Page 30: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

ISI Web of Science (1)

Page 31: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

ISI Web of Science (2)

Page 32: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

ISI Web of Science (3)

Page 33: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

ISI Web of Science (4)

Page 34: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

ISI Web of Science (5)

Page 35: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

citedarticle

Science Citation Index

citedarticle

citedarticle

article

citingarticle

citingarticle

citingarticle

citingarticle

citingarticle

citingarticle

citingarticle

Association-of-ideas index

http://libweb.hawaii.edu/uhmlib/databases/er_title.html#WEB

Page 36: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Co-citation analysisArticles that cite the same article are likely to both be of interest to the reader of the cited article

article

citingarticle

citingarticle

These two articles are likely to be related

Page 37: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Selecting productive journals

Samuel Clement Bradford 1934 - Sources of information on specific subjects

Purpose: to develop a means by which librarians could select the most usable periodicals

Published in: Engineering 137:85-86

First paper published on observations of scattering

Bradford's Law

Page 38: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Bradford's Law of Scattering (1)

"If scientific journals are arranged in order of decreasing productivity of articles on a given subject, they may be divided into a nucleus of periodicals more particularly devoted to the subject and several groups or zones containing the same number of articles as the nucleus, when the numbers of periodicals in the nucleus and succeeding zones will be as a : n : n2 : n3 …"

Page 39: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Bradford's Law of Scattering (2)

No. of source journals

121224

10755

No. of articles per source

60353025986543

Total no. of articles

60703050183260352015

9

27

130

130

1303

Page 40: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Bradford's Law of Scattering (3)

3 sources 130 articles

9 sources 9 sources 130 articles130 articles

27 sources 27 sources 130 articles130 articles

Page 41: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

George Kingsley Zipf 1935

The psycho-biology of language: an introduction to dynamic philology

Frequency distributions of words

Published by MIT Press

Two lawsLess frequently occurring

wordsFrequently occurring words

Page 42: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Zipf's Law of High Frequency Words

For a given text the rank of a word multiplied by the frequency is a constant.

Proposed in 1949 by George Kingsley Zipf

Where:r = rank (in terms of frequency)f = frequency (no. of times the given word is used in the text)c = constant for the given text

r • f = c

Page 43: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Application of Zipf's laws

Determine transition point between high- and low-frequency words

William Goffman - automatic indexing

Collect equal number of words above and below the transition point

Eliminate trivial words using stop list

Remaining content-bearing words indicate document contents

Page 44: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Obsolescence of resources

Charles F. Gosnell 1944 - Obsolescence of books in college libraries

Purpose: "to discover lines of trend or curves of distribution by means of which this rate of obsolescence may be expressed in mathematical form"

Published in: College Res. Libr. 5:115-125

Page 45: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Curve of obsolescenceN

um

ber

of

use

rs

Age at time of use

Page 46: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Alan Pritchard 1969

Statistical bibliography or bibliometrics?

Coined the term "bibliometrics""the application of mathematics and statistical methods to books and other media of communication"

Published in: Journal of Documentation 25(4):348-349

Page 47: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Google indexing criteria

Text within page being indexed to determine topic

Links to page being indexed

Anchor text of links to page being indexed (indication of topic)

Weight links to page being indexed by links to the linking pages

“For a good explanation of Bradford’s Law of Scattering see...”

Page 48: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

GoogleTreating links as citations to compute PageRank

high-weight linkage

low-weight linkage

Page 49: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Citation tree rings represent the citation history of an article. The color of a citation ring denotes the time of corresponding citations. The thickness of a ring is proportional to the number of citations in a given time slice. Chen, C. 2006. CiteSpace II: detecting and visualizing emerging trends and transient patterns in scientific literature. Journal of the American Society for Information Science and Technology 57(3):359-3787.

Page 50: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Bibliometrics in Action

A time-zone view of mass-extinction research. Chen, C. 2006. CiteSpace II: detecting and visualizing emerging trends and transient patterns in scientific literature. Journal of the American Society for Information Science and Technology 57(3):359-3787.

Page 51: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.
Page 52: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Adding bibliometric visualizations to digital library search results

Page 53: Boolean, bibliometrics, and beyond LIS 670 donna Bair-Mundy Part 2.

Adding bibliometric visualizations to digital library search results