Top Banner
The The Web Web and and Websit Websit es es Along Along came a came a Spider! Spider!
33

Websites 2003 version

Dec 05, 2014

Download

Education

Johan Koren

 
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: Websites 2003 version

The Web The Web and and

WebsitesWebsites

Along came Along came a Spider!a Spider!

Page 2: Websites 2003 version

22

First: What is the Internet?First: What is the Internet?

For one thing, it’s not really “the For one thing, it’s not really “the net”, it’s the “netnet”, it’s the “netss”:”:– “the Internet: a cooperatively-run

collection of computer networks that span the globe.”

2

Page 3: Websites 2003 version

33

Is it the same as the Web?Is it the same as the Web?

Internet Internet ≠≠ World Wide Web World Wide Web– The Internet is a massive network of

networks, a networking infrastructure.

– The World Wide Web, or simply Web, is a way of accessing information over the medium of the Internet. It is an information-sharing model that is built on top of the Internet.

– The Internet, not the Web, is also used for e-mail, which relies on SMTP, Usenet news groups, instant messaging and FTP.

• The Difference Between the Internet and the World Wide Web

3

Page 4: Websites 2003 version

44

Cold War Technology?Cold War Technology?

Originally designed by the U.S. Department of Originally designed by the U.S. Department of Defense so that a communication signal could Defense so that a communication signal could withstand nuclear war and serve military institutions withstand nuclear war and serve military institutions worldwide, the Internet was first known as the worldwide, the Internet was first known as the ARPANet, the most , the most robust communication communication technology. It is a system of linked computer technology. It is a system of linked computer networks, international in scope, that facilitates , international in scope, that facilitates data transfer and communication services, such as remote transfer and communication services, such as remote login, login, file transfer file transfer ((FTP), electronic mail (), electronic mail (e-mail), ), newsgroups, and the , and the World Wide Web. The Internet . The Internet greatly extends the reach of each connected greatly extends the reach of each connected computer networkcomputer network (see: (see: network effect, , IP).).

– Internet

Page 5: Websites 2003 version

55

Before ARPANetBefore ARPANet

Before ARPANET, most Before ARPANET, most computer systems consisted of systems consisted of a massive computer -- sometimes the size of an entire a massive computer -- sometimes the size of an entire room -- with user terminals hardwired to it. A terminal room -- with user terminals hardwired to it. A terminal was some form of user interface, often consisting of a was some form of user interface, often consisting of a keyboard or punch card reader. Multiple users could or punch card reader. Multiple users could access the computer simultaneously, in a technique access the computer simultaneously, in a technique called timesharing. Other early called timesharing. Other early networks required a required a direct connection between host computers, meaning direct connection between host computers, meaning that there was only one path for information to flow that there was only one path for information to flow through. The direct connections limited the size of through. The direct connections limited the size of these computer networks, which became known as these computer networks, which became known as local area networks (LANs).local area networks (LANs).

– How ARPANET Works

Page 6: Websites 2003 version

66

Phone-linked networksPhone-linked networks ““In the 1960s, as many as a few hundred In the 1960s, as many as a few hundred

users could have accounts on a single large users could have accounts on a single large computer using terminals, and exchange computer using terminals, and exchange messages and files between them. But each messages and files between them. But each of those little communities was an island, of those little communities was an island, isolated from others. By reliably connecting isolated from others. By reliably connecting different kinds of computers to each other, different kinds of computers to each other, the ARPANET took a crucial step toward the ARPANET took a crucial step toward the online world that links nearly a third of the online world that links nearly a third of the world's population today.”the world's population today.”– Marc Weber, founding curator of the

Computer History Museum’s Internet History Program

On October 29, 2009, SRI celebrated the 40th anniversary of the first ARPANET connection

.

Page 7: Websites 2003 version

77From mainframes to minicomputersFrom mainframes to minicomputers

Before ARPANET, most Before ARPANET, most computer systems systems consisted of a massive computer -- sometimes the consisted of a massive computer -- sometimes the size of an entire room -- with user terminals size of an entire room -- with user terminals hardwired to it.hardwired to it.

Page 8: Websites 2003 version

88

Minicomputers?Minicomputers? Minicomputers are a largely obsolete class of s are a largely obsolete class of

multi-user multi-user computers which made up the which made up the middle range of the computing spectrum, in middle range of the computing spectrum, in between the largest multi-user systems between the largest multi-user systems (mainframe computers) and the smallest (mainframe computers) and the smallest single-user systems (microcomputers or single-user systems (microcomputers or personal computers))

Page 9: Websites 2003 version

99

When did ARPAnet become the Internet?When did ARPAnet become the Internet?

““. . . Ray Tomlinson is credited with inventing . . . Ray Tomlinson is credited with inventing email in 1972. . . . He picked the @ symbol from email in 1972. . . . He picked the @ symbol from the computer keyboard to denote sending the computer keyboard to denote sending messages from one computer to another. So then, messages from one computer to another. So then, for anyone using Internet standards, it was for anyone using Internet standards, it was simply a matter of nominating name-of-the-simply a matter of nominating [email protected]@name-of-the-computer.

““. . . 1975 seems to be the definitive year in which, . . . 1975 seems to be the definitive year in which, for the first time, networks connected to each for the first time, networks connected to each other.”other.”

– Ian Peter's History of the Internet

Page 10: Websites 2003 version

1010

1975, the net goes commercial1975, the net goes commercial

Telenet– One of the first value-added, packet

switching networks that enabled terminals and computers to exchange data. Established in 1975 by Dr. Lawrence Roberts, who helped to develop ARPANET, Telenet was acquired by GTE in 1979. After it was acquired by Sprint in 1986, it was renamed SprintNet

Page 11: Websites 2003 version

1111Telenet is not to be Telenet is not to be confused with Telnetconfused with Telnet

What is telnet? – Telnet and its close cousins rlogin and

tn3270 are methods of connecting to a remote [mainframe] computer over the Internet that let you use programs and data just as if you were using the computer locally. Do not confuse telnet with Telenet, the old name for Sprintnet.

– Telnet is a text-only protocol. At one time it was one of the most common ways to connect to other sites.

Page 12: Websites 2003 version

1212

Significance of TelSignificance of Teleenet?net?

Published 1991

Page 13: Websites 2003 version

1313

1980s1980s

Main uses scholarly or militaryMain uses scholarly or military– Libraries use networks like Telenet and

Tymnet for remote searching of databases– Scientists and scholars communicate by email

The Silent 700 was a line of portable computer terminals manufactured by Texas Instruments in the 1970s and 1980s. Silent 700s printed with a dot-matrix heating element onto a roll of heat-sensitive paper. They were equipped with an integrated acoustic coupler and modem that could receive data at 30 characters per second.

Page 14: Websites 2003 version

1414

What, no fun things?What, no fun things?

Enter the BBS!Enter the BBS!– Bulletin Board System

• A Bulletin Board System, or BBS, is a computer system running software that allows users to connect and log in to the system using a terminal program. Once logged in, a user can perform functions such as uploading and downloading software and data, reading news and bulletins, and exchanging messages with other users, either through electronic mail or in public message boards. Many BBSes also offer on-line games, in which users can compete with each other, and BBSes with multiple phone lines often provide chat rooms, allowing users to interact with each other.

Page 15: Websites 2003 version

1515

BBS all text, very little graphicsBBS all text, very little graphics

Monochrome BBS, known to users as “Mono”, was originally a student bulletin board system at [London’s] City University in the early 1990s. The BBS is still in existence with a web presence at http://www.mono.org/ from where you can connect to the real thing by telnet. See also Monochrome BBS – Definition.

Page 16: Websites 2003 version

1616

Related to BBSRelated to BBS

Gopher Gopher – The Gopher Protocol is a distributed document search

and retrieval protocol that was developed at the University of Minnesota in the late 1980s. Resources are stored on Gopher servers, which organize information using a hierarchical directory structure. Gopher clients access servers to retrieve directory listings of available resources, which are presented to the user as a menu from which an item may be selected for retrieval.

• Gopher Protocol (Gopher) (Page 4 of 4)

Page 17: Websites 2003 version

1717

A Gopher menuA Gopher menu

From a Finnish History of the Internet (click on 1991 to get the page where this is reproduced)To navigate the menus, you used the arrow keys (no mouse, of course!) to move the arrow up or down the menu and then hit Enter to select the item you want. Current browsers no longer support Gopher.

Page 18: Websites 2003 version

1818Veronica, Jughead and Veronica, Jughead and Archie (but not Betty)!Archie (but not Betty)!

Rodent companions!Rodent companions!– Veronica: “Very Easy Rodent-Oriented Net-wide Index to

Computerized Archives”

– Jughead: “Jonzy's Universal Gopher Hierarchy Excavation and Display”

– Archie: a popular FTP [“File Transfer Protocol”] search program of the time. Though the legend of Archie being named for the cartoon, the name in fact is shorthand for “Archives.”

• A Pre-Web Search Engine, Gopher Turns Ten By Chris Sherman, Search Engine Watch, Feb 6, 2002

Page 19: Websites 2003 version

1919Finally, The Web is added: Finally, The Web is added: 19911991

Tim Berners-Lee:– “. . . in 1989, while working at the European Particle

Physics Laboratory, I proposed that a global hypertext space be created in which any network-accessible information could be refered to by a single "Universal Document Identifier". Given the go-ahead to experiment by my boss, Mike Sendall, I wrote in 1990 a program called "WorldWideWeb", a point and click hypertext editor which ran on the "NeXT" machine. This, together with the first Web server, I released to the High Energy Physics community at first, and to the hypertext and NeXT communities in the summer of 1991.

• The World Wide Web: A very short personal history

He almost called it the “Mesh”!

Sir Tim Berners-Lee on the Web (past, present and future)

Page 20: Websites 2003 version

2020

The first “real” browserThe first “real” browser

NCSA MosaicIn 1991, the NCSA introduced NCSA Mosaic, the first readily-available graphical Web Browser that virtually kickstarted the dot.com revolution. It may not look like much now – but it is interesting to consider how similar modern browsers look to the original.PawPrint.net Glossary of Terms

Page 21: Websites 2003 version

2121

Today’s browsersToday’s browsers

Page 22: Websites 2003 version

2222

Alphabet soup of the WebAlphabet soup of the Web

URLURL– Uniform Resource Locator

HTTPHTTP– HyperText Transfer Protocol

HTMLHTML– Hypertext Markup Language– Now being complemented by

XML• EXtensible Markup Language

– See What is XML?

Page 23: Websites 2003 version

2323What does HTML or XML look like?What does HTML or XML look like?

Page 24: Websites 2003 version

2424

A free site to create your ownA free site to create your own

What is Weebly?What is Weebly?– Weebly is a San Francisco,

California based company that was founded in 2006 with the mission to help people put their information online quickly and easily. We now enable 3 million people to easily create personal sites and blogs or establish web presences for businesses, weddings, classrooms, churches, artistic portfolios, and more.

Page 25: Websites 2003 version

2525Basic URL Structure

PartsParts– A URL has three basic parts: the protocol (how to get the resource);

the server id (who to get the resource from); and the resource id (the name of the resource and how to find it on the target machine). In its most basic form, this looks like the following:

– The "http" indicates that this is a Web document. The "www.fake.com" is the domain name of the (in this case, fictional) machine on which the web server is running (we know it's a web server because of the protocol). And, of course, "doc.html" is the filename of the HTML document (notice the file extension ".html") on that machine.

Page 26: Websites 2003 version

2626

Domain name?Domain name?

What is a ‘Domain Name’?– Domain Name System, or DNS, is the most

recognized system for assigning addresses to Internet web servers (aka “Internet hosts”). Somewhat like international phone numbers, the domain name system helps to give every Internet server a memorable and easy-to-spell address. Simultaneously, the domain names keep the really technical IP address invisible for most viewers.

• By Paul Gil, About.com Guide

Page 27: Websites 2003 version

2727

Structure of a Domain Name 1Structure of a Domain Name 1

What is a Top Level Domain (TLD)?– A Top Level Domain (TLD) is the “suffix”

or the end of each domain name. (e.g., the “.com” in yahoo.com is the TLD.) There are two types of TLDs - global and country code.

Generic Top Level Domain (gTLDs) extensions include: .com, .net, .org, .biz, .coop, .edu, .gov, .info, .int, .mil, and .museum.

Page 28: Websites 2003 version

2828

Country code TLDsCountry code TLDs

– Country Code Top Level Domains (ccTLDs) are TLDs created by a country, such as .it, which is the country code for Italy or .tv which is the country code for Tuvalu, and of course .us for the United States

• A complete list of ccTLDs (sorted by ccTLD) can be found at http://www.iana.org/cctld/cctld-whois.htm.

oWhat is a Top Level Domain (TLD)?

Page 29: Websites 2003 version

2929

Structure of a Domain Name 2Structure of a Domain Name 2

Second-level domain– In the Domain Name System

(DNS) hierarchy, it is the highest level underneath the top-level domains. It is that portion of the domain name that appears immediately to the left of the top-level domain, separated by a dot. For example, the “NetLingo” in www.netlingo.com is a second-level domain.

Page 30: Websites 2003 version

3030

Structure of a Domain Name 3Structure of a Domain Name 3

SubDomain - The Third Level DomainSubDomain - The Third Level Domain– If you need to further distinguish your second-

level domain name, you can use a third-level domain name, such as “resources.hostway.com.” Typically a third-level domain name is used to refer to different servers within different departments of a company.

• Creating third-level domains

Page 31: Websites 2003 version

3131

Success of the web?Success of the web?

Tim Berners-Lee:Tim Berners-Lee:– The success of the World Wide Web, itself built on the open Internet, has

depended on three critical factors: 1) unlimited links from any part of the Web to any other; 2) open technical standards as the basis for continued growth of innovation applications; and 3) separation of network layers, enabling independent innovation for network transport, routing and information applications. Today these characteristics of the Web are easily overlooked as obvious, self-maintaining, or just unimportant. All who use the Web to publish or access information take it for granted that any Web page on the planet will be accessible to anyone who has an Internet connection, regardless whether it is over a dialup modem or a high speed multi-megabit per second digital access line. The last decade has seen so many new ecommerce startups, some of which have formed the foundations of the new economy, that we now expect that the next blockbuster Web site or the new homepage for your kid's local soccer team will just appear on the Web without any difficulty.

• Testimony of Sir Timothy Berners-Lee, Hearing on the “Digital Future of the United States: Part I -- The Future of the World Wide Web”

Page 32: Websites 2003 version

The Internet is a far more speech-enhancing medium than print, the village green, or the mails.... The Internet may fairly be regarded as a never-ending worldwide conversation.[1]

Statement by a federal judge in American Civil Liberties Union v. Reno, 929 F. Supp. 824, 844 (E.D. Pa. 1996) (Dalzell, J.). Quoted by Tim Berners-Lee in his Testimony at Hearing on the “Digital Future of the United States: Part I -- The Future of the World Wide

Web”

Page 33: Websites 2003 version

3333