Photo: The "new" Old Shambles http://www.geograph.org.uk/photo/42671 taken by Keith Williamson http://www.geograph.org.uk/profile/104 The changing landscape of search: essential new tools for finding information John Rylands University Library Wednesday, 14 th July 2010 Presenter: Karen Blakeman
UKeiG workshop held on 14th July 2010, at the John Rylands University Library, Manchester.
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Photo: The "new" Old Shambles http://www.geograph.org.uk/photo/42671 taken by Keith Williamson http://www.geograph.org.uk/profile/104 This presentation is licensed under a Creative Commons Attribution 3.0 License
The changing landscape of search: essential new tools for finding informationJohn Rylands University Library
Wednesday, 14th July 2010 Presenter: Karen Blakeman
• By default, the main search tools look for all of your terms in a page
• Use double quote marks around phrases or names
• To exclude pages containing a term, precede the term with a minus sign (-) but use with care
• Focus your search in areas of the page- use advanced search screens or commands
- inurl: for example UK inurl:carbonemissions
• looks for your terms/phrase in the URL
- intitle: or allintitle: for example UK intitle:"carbon emissions"
• looks for your terms/phrase in the title of the page
11 April 2023 Karen Blakeman www.rba.co.uk 16
General search techniques
• Search sites or domains using the advanced search screen or site: command– wind turbine energy generation site:statistics.gov.uk
– wind turbine energy generation site:gov.uk
• Imagine what you would like to appear in your ideal document and include those terms in your strategy- renewable energy generation UK wind solar wave hydro
• Repeat your key search terms in your search strategy– tar sands EROEI– tar sands EROEI EROEI EROEI
• Change the order of your terms– biodiesel fuel UK buses efficiency – biodiesel fuel efficiency UK buses
11 April 2023 Karen Blakeman www.rba.co.uk 17
File format search
• Use advanced search options to limit your search to file
types or format:
– pdf or doc for government or industry/market reports
– xls for data and statistics
– ppt or pdf for presentations, experts on a topic
• Use Google and Yahoo Advanced Search
• Bing.com sometimes picks up unique content
– use command filetype: in your search strategy e.g.
• Is your web history switched on? Now Switched on by default
• Google adjusts your search results according to what you have clicked on in the past
• See: Personalized Search for everyone http://googleblog.blogspot.com/2009/12/personalized-search-for-everyone.html and Karen Blakeman's Blog - Your Google results are about to get weirder http://www.rba.co.uk/wordpress/2009/12/17/your-google-results-are-about-to-get-weirder/
• Additional search features previously hidden under “Show options” now in left hand side menu of your results page
11/04/23 www.rba.co.uk 26
?
Unique Google search features
• Similar or related pages– looks for pages that are similar in type and content
– click on the Similar link next to the page in your results list
– or use the related: command e.g. related:www.bwea.com
11/04/23 www.rba.co.uk 27
Unique Google search features
• Automatically looks for variations on your terms
– to stop it, precede your terms with plus signs e.g. +Norne oil field to stop Google looking for ‘Horne’
– or use double quote marks around the term or phrase “Norne”
• Numeric range search
– can be weights, distances, years, prices, measurements of any sort
– use Advanced Search screen
– or the search box on the Google home page
– search term(s) first value..second value unit of measurement
– toblerone 1..6 kg
– world oil production forecasts 2010..2015
11 April 2023 Karen Blakeman www.rba.co.uk 28
Unique Google search features
• Proximity– use the asterisk (*) to stand in for one or more terms
– useful when searching for people
– separates the terms by one or more words
– Karen * Blakeman picks up:
• Karen Blakeman
• Karen Mary Blakeman
• Karen Sands Blakeman
• Karen Marie Blakeman
– solar * panels picks up:
• solar heating panels
• solar electricity panels
• solar thermal panel
• solar voltaic panels
• solar concentrator panels
11 April 2023 Karen Blakeman www.rba.co.uk 29
Yahoo!• http://www.yahoo.co.uk/ • http://search.yahoo.co.uk/ http://search.yahoo.com/ • Deal with Bing• Some country versions already using Bing web search, rest
will follow during the year• Now includes options in the left side menu for limiting your
search (vary depending on type of search and country version of Yahoo)
• Not comprehensive and omits many key scientific publications
• Both peer-reviewed and un-reviewed articles, pre-prints, institutional repositories, references to books, citations
• Does not use publishers’ meta data
• Author search unreliable
• Search on year of publication unreliable
• “Google Scholar is brain damaged”Peter Jasco, Trends in Professional and Academic Online Information Services, presented at INFORUM, 22nd May 2007, Prague
Use with caution and be aware that there are huge gaps in coverage
iSEEK• http://www.iseek.com/ • Clusters results into topics, people, places, organisations, date & time• “Education” option – more research oriented pages
• A chemistry search engine aggregating & indexing chemical structures and their associated information into a single, free of charge, searchable repository
• Users encouraged to correct errors in data from peer reviewed journals, patents, FDA etc.
• Does ChemSpider Have Millions of Errors?http://www.chemspider.com/blog/does-chemspider-have-millions-of-errors.html
Guardian Data Blog http://www.guardian.co.uk/news/datablog Guardian World Government Data http://www.guardian.co.uk/world-government-data - a better way of searching government statistics sites!
Twitter Greasemonkey script add-on in my personal Firefox – adds 5 latest tweets to top of all Google searches. http://mt-hacks.com/20090302-realtime-twitter-search-results-on-google.html
• Look for groups or pages for an organisation (are they genuine?)
• Some professional groups
• Pages now used extensively to market organisations and services and for market research
• Reputation monitoring and management
11 April 2023 Karen Blakeman www.rba.co.uk 85
LinkedIn.com
• Check up on individuals, companies, groups
11 April 2023 Karen Blakeman www.rba.co.uk 86
Cluuz
• http://www.cluuz.com/
• “Cluuz … core technology understands the relationship between the entities, terms, or persons searched leading to more relevant, easy to understand search results”
• Network visualisation - easier to see connections between individuals and organisations than reading the text of a 10-20 web pages
• Results may change from one day to the next, one hour to the next
11 April 2023 Karen Blakeman www.rba.co.uk 87
Cluuz
11 April 2023 Karen Blakeman www.rba.co.uk 88
People networks & Search Tools
• Beware mash-ups– a website or application that combines content from more
than one source to generate a new page or resource
– usually automated with minimal human input or control
– For example:
• Zoominfo
• http://www.zoominfo.com/
• uses multiple web sources to generate profiles of people and networks
• subject of a profile can update or correct their profile but no checking done by Zoominfo
11 April 2023 Karen Blakeman www.rba.co.uk 89
123People.com
• Searches
– image sections of major search engines
– Flickr
– Facebook
– MySpace
– LinkedIn
– blogs
– web
– videos
– news
11 April 2023 Karen Blakeman www.rba.co.uk 90
The People Search Engine – Whoozy.com
11 April 2023 Karen Blakeman www.rba.co.uk 91
Pipl http://www.pipl.com/
“Pipl's query-engine helps you find deep web pages that cannot be found on regular search engines.
Pipl is designed to retrieve information from the deep web, our robots are set to interact with searchable databases and extract facts, contact details and other relevant information from personal profiles, member directories, scientific publications, court records and numerous other deep-web sources.”
11/04/23 www.rba.co.uk 92
Search tools for social media• Excellent presentation from Phil Bradley covers the best
– http://www.slideshare.net/Philbradley/social-media-search-engines – “20 search engines that each add something into social media
search, and they’re all worth exploring in some detail”
• My two personal favourites from his list– http://www.icerocket.com/– http://www.addictomatic.com/
11/04/23 www.rba.co.uk 93
Netvibes.com• New dashboard feature (must be signed in to your Netvibes account)
– type in your keywords and Netvibes automatically searches across news, social media, videos etc.
– looks in places you might not think of
11/04/23 www.rba.co.uk 94
Design your own search engine
• For
– regularly searched sites
– selected sites on a topic
– searching sites on a reading list
• Google Custom Search Engines
– http://www.google.com/cse/
– at least hundreds of sites, maybe thousands!
– can import lists of sites
• Need a Google account to set one up
• Cannot search password protected sources or sites where you have to fill in a form to access the information
11 April 2023 Karen Blakeman www.rba.co.uk 95
Create your own Google CSE
11 April 2023 Karen Blakeman www.rba.co.uk 96
Host it on Google
• Keep it private or make it public
• URLs are horribly long but you can create a TinyURL, Bit.ly URL etc. to make it easier for your users
11 April 2023 Karen Blakeman www.rba.co.uk 97
Disappearing pages
Search engine cache copies– Google, Yahoo, Bing
Firefox users– install the Resurrect Pages add-on
UK Web Archive http://www.webarchive.org.uk/ukwa/ Wayback machine
– http://www.archive.org/– from 1996 to about 1 year ago– navigate the archived site or type in the full URL of the document if
known
27 November 2006 Karen Blakeman www.rba.co.uk 9811 April 2023 Karen Blakeman www.rba.co.uk 98
Wayback Machine
11 April 2023 Karen Blakeman www.rba.co.uk 99
Further information & keeping up to date
• Information World Review (IWR) and IWR Blog– http://www.iwr.co.uk/, http://blog.iwr.co.uk/
• Phil Bradley’s Blog– http://philbradley.typepad.com/