Module 4 - Reading5 - UniformResourceLocator

A uniform resource locator (URL) specifies the location and retrieval method of a resource on a computer network. A URL identifies a web page, file, email address, or other application. It consists of components like the protocol, domain name, path, port number, query string, and fragment identifier. URLs were standardized in 1994 and allow resources to be located and accessed across the internet.

Uploaded by

Christopher Advincula

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

95 views

Module 4 - Reading5 - UniformResourceLocator

Uploaded by

Christopher Advincula

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 7

Reading: Uniform Resource Locator

Introduction

A uniform resource locator (URL) is a reference to a

resource that specifies the location of the resource on a
computer network and a mechanism for retrieving it. A URL
is a specific type of uniform resource identifier
(URI), although many people use the two terms
interchangeably. A URL implies the means to access an
indicated resource, which is not true of every URI. URLs
occur most commonly to reference web pages (http), but are
also used for file transfer (ftp), email (mailto), database
access (JDBC), and many other applications.
Most web browsers display the URL of a web page above
the page in an address bar. A typical URL has the
form http://www.example.com/index.html, which indicates
the protocol type (http), the domain name,
(www.example.com), and the specific web page (index.html).
History
The Uniform Resource Locator was standardized in 1994 by
Tim Berners-Lee and the URI working group of the Internet
Engineering Task Force (IETF) as an outcome of
collaboration started at the IETF Living Documents “Birds of
a Feather” session in 1992. The format combines the pre-
existing system of domain names (created in 1985) with file
path syntax, where slashes are used to separate directory
and file names. Conventions already existed where server
names could be prepended to complete file paths, preceded
by a double-slash (//).
Berners-Lee later regretted the use of dots to separate the
parts of the domain name within URIs, wishing he had used
slashes throughout. For
example,http://www.example.com/path/to/name would have
been written http:com/example/www/path/to/name. Berners-
Lee has also said that, given the colon following the URI
scheme, the two slashes before the domain name were also
unnecessary.
Syntax
Every HTTP URL consists of the following, in the given
order. Several schemes other than HTTP also share this
general format, with some variation.
• the scheme name (commonly called protocol, although not
every URL scheme is a protocol, e.g. mailto is not a
protocol)
• a colon, two slashes,
• a host, normally given as a domain name For example,
http://www.example.com/path/to/name would have
been written http:com/example/www/path/to/name but
sometimes as a literal IP address
• optionally a colon followed by a port number
• the full path of the resource
The scheme says how to connect, the host specifies where
to connect, and the remainder specifies what to ask for.
For programs such as Common Gateway Interface (CGI)
scripts, this is followed by a query string, and an optional
fragment identifier.
The syntax is:
scheme://[user:password@]domain:port/path?
query_string#fragment_id
Component details:
• The scheme, which in many cases is the name of a
protocol (but not always), defines how the resource will
be obtained. Examples include http, https, ftp, file and
many others. Although schemes are case-insensitive,
the canonical form is lowercase.
• The domain name or literal numeric IP address gives the
destination location for the URL. A literal numeric IPv6
address may be given, but must be enclosed in [ ] e.g.
[db8:0cec::99:123a]. The domain google.com, or its
numeric IP address 173.194.34.5, is the address of
Google’s website.
• The domain name portion of a URL is not case sensitive
since DNS ignores case: http://en.example.org/ and
HTTP://EN.EXAMPLE.ORG/ both open the same page.
• The port number, given in decimal, is optional; if omitted,
the default for the scheme is used. For example,
http://vnc.example.com:5800 connects to port 5800 of
vnc.example.com, which may be appropriate for a VNC
remote control session. If the port number is omitted for
an http: URL, the browser will connect on port 80, the
default HTTP port. The default port for an https: request
is 443.
• The path is used to specify and perhaps find the resource
requested. This path may or may not describe folders
on the file system in the web server. It may be very
different from the arrangement of folders on the web
server. It is case-sensitive, though it may be treated as
case-insensitive by some servers, especially those
based on Microsoft Windows. If the server is case
sensitive and http://en.example.org/wiki/URL is correct,
then http://en.example.org/WIKI/URL or
http://en.example.org/wiki/url will display an HTTP 404
error page, unless these URLs point to valid resources
themselves.
• The query string contains data to be passed to software
running on the server. It may contain name/value pairs
separated by ampersands, for example ?
first_name=John&last_name=Doe.
• The fragment identifier, if present, specifies a part or a
position within the overall resource or document. When
used with HTML, it usually specifies a section or
location within the page, and used in combination with
Anchor elements or the “id” attribute of an element, the
browser is scrolled to display that part of the page.
The scheme name defines the namespace, purpose, and the
syntax of the remaining part of the URL. Software will try to
process a URL according to its scheme and context. For
example, a web browser will usually dereference the URL
http://example.org:80 by performing an HTTP request to the
host at example.org, using port number 80.
Other examples of scheme names include https, gopher,
wais, ftp. URLs with https as a scheme (such as
https://example.com/) require that requests and responses
will be made over a secure connection to the website. Some
schemes that require authentication allow a username, and
perhaps a password too, to be embedded in the URL, for
exampleftp://[email protected]. Passwords
embedded in this way are not conducive to security, but the
full possible syntax is
scheme://username:password@domain:port/path?
query_string#fragment_id
Other schemes do not follow the HTTP pattern. For
example, the mailto scheme only uses valid email
addresses. When clicked on in an application, the URL
mailto:[email protected] start an e-mail composer with
the address [email protected] in the To field. The tel
scheme is even more different; it uses the public switched
telephone network for addressing, instead of domain names
representing Internet hosts.
List of allowed URL characters
Unreserved
The alphanumerical upper and lower case character may
optionally be encoded:
ABCDEFGHIJKLMNOPQRSTUVWXYZ
abcdefghijklmnopqrstuvwxyz
0123456789–_.~
Reserved
Special symbols must sometimes be percent-encoded:
! * ‘ ( ) ; : @ & = + $ , / ? % # [ ]
Further details can for example be found in RFC 3986 and
http://www.w3.org/Addressing/URL/uri-spec.html.
Relationship to URI
A URL is a URI that, in addition to identifying a web
resource, provides a means of locating the resource by
describing its “primary access mechanism (e.g., its network
location)”.
Internet hostnames
A hostname is a domain name assigned to a host computer.
This is usually a combination of the host’s local name with its
parent domain’s name. For example, en.example.org
consists of a local hostname (en) and the domain name
example.org. The hostname is translated into an IP address
via the local hosts file, or the domain name system (DNS)
resolver. It is possible for a single host computer to have
several hostnames; but generally the operating system of
the host prefers to have one hostname that the host uses for
itself.
Any domain name can also be a hostname, as long as the
restrictions mentioned below are followed. For example, both
“en.example.org” and “example.org” can be hostnames if
they both have IP addresses assigned to them. The domain
name “xyz.example.org” may not be a hostname if it does
not have an IP address, but “aa.xyz.example.org” may still
be a hostname. All hostnames are domain names, but not all
domain names are hostnames.
URL protocols
The protocol, or scheme, of a URL defines how the resource
will be obtained. Two common protocols on the web are
HTTP and HTTPS. For various reasons, many sites have
been switching to permitting access through both the HTTP
and HTTPS protocols. Each protocol has advantages and
disadvantages, including for some of the users that one or
the other protocol either does not function, or is very
undesirable. When a link contains a protocol specifier it
results in the browser following the link using the specified
protocol regardless of the potential desires of the user.
Protocol-relative URLs
It is possible to construct valid URLs without specifying a
protocol which are called protocol-relative links (PRL) or
protocol-relative URLs. Using PRLs on a page permits the
viewer of the page to visit new pages using whichever
protocol was used to obtain the page containing the link.
This supports continuing to use whichever protocol the
viewer has chosen to use for obtaining the current page
when accessing new pages.
An example of a PRL is //en.wikipedia.org/wiki/Main_Page
which is created by removing the protocol prefix.
Internationalized URL
Internet users are distributed throughout the world using a
wide variety of languages and alphabets. Users expect to be
able to create URLs in their own local alphabets.
An internationalized resource identifier (IRI) is a form of URL
that includes Unicode characters. All modern browsers
support IRIs. The parts of the URL requiring special
treatment for different alphabets are the domain name and
path.
The domain name in the IRI is known as an internationalized
domain name (IDN). Web and Internet software
automatically convert the domain name into punycode
usable by the Domain Name System.
For example, the Chinese web site http://見.香港 becomes
the following for DNS lookup. xn-- indicates the character
was not originally ASCII.
http://xn--nw2a.xn--j6w193g/
The URL path name can also be specified by the user in the
local alphabet. If not already encoded, it is converted to
Unicode, and any characters not part of the basic URL
character set are converted to English letters using percent-
encoding.
For example, the following Japanese Web page
http://domainname/引き割り.html becomes
http://domainname/%E5%BC%95%E3%81%8D
%E5%89%B2%E3%82%8A.html. The target computer
decodes the address and displays the page.

Chapter 5 - Data and Process Modeling PDF
0% (1)
Chapter 5 - Data and Process Modeling PDF
45 pages
The Components of A URL
100% (2)
The Components of A URL
3 pages
What is a URL_ - Learn Web Development _ MDNPDF_221229_142527
No ratings yet
What is a URL_ - Learn Web Development _ MDNPDF_221229_142527
9 pages
Internet Vs World Wide Web
No ratings yet
Internet Vs World Wide Web
10 pages
HTML Browser URL Address Bar Hyperlinks: Website Search Engine
No ratings yet
HTML Browser URL Address Bar Hyperlinks: Website Search Engine
4 pages
What Is A URL
No ratings yet
What Is A URL
4 pages
Everything About URL
No ratings yet
Everything About URL
3 pages
en-USdocsLearnCommon Questionsweb Mechanicswhat Is A URL
No ratings yet
en-USdocsLearnCommon Questionsweb Mechanicswhat Is A URL
8 pages
Internet Protocols Internet Protocols: ICT Grade 9
No ratings yet
Internet Protocols Internet Protocols: ICT Grade 9
2 pages
URL in HTML
No ratings yet
URL in HTML
4 pages
hmtl urls
No ratings yet
hmtl urls
14 pages
Unit 1
No ratings yet
Unit 1
48 pages
URL
No ratings yet
URL
4 pages
Url
No ratings yet
Url
25 pages
URLs
No ratings yet
URLs
5 pages
About Urls
No ratings yet
About Urls
8 pages
What is the URL
No ratings yet
What is the URL
7 pages
BCA SEE411 Lecture Notes V1
No ratings yet
BCA SEE411 Lecture Notes V1
10 pages
Web Computing Basics
No ratings yet
Web Computing Basics
40 pages
A Url
No ratings yet
A Url
1 page
01. Web Requests
No ratings yet
01. Web Requests
37 pages
Madhu
No ratings yet
Madhu
11 pages
Fundamental of Internet
No ratings yet
Fundamental of Internet
14 pages
CSI225 Internet Computing: About The Web
No ratings yet
CSI225 Internet Computing: About The Web
6 pages
URL (Uniform Resource Locator) Resource Locator.: - URL Is Also Sometimes Called As Universal
100% (1)
URL (Uniform Resource Locator) Resource Locator.: - URL Is Also Sometimes Called As Universal
8 pages
6-Empowerment-Technologies
No ratings yet
6-Empowerment-Technologies
1 page
Web Application Basics
No ratings yet
Web Application Basics
22 pages
domain name and URL
No ratings yet
domain name and URL
4 pages
Uniform Resource Locators (Urls) : Web Technologies
No ratings yet
Uniform Resource Locators (Urls) : Web Technologies
17 pages
Web Services
No ratings yet
Web Services
3 pages
Project and Seminar Lab Services of WWW: Submitted by Ashik Shukoor V Jos Rapheal
No ratings yet
Project and Seminar Lab Services of WWW: Submitted by Ashik Shukoor V Jos Rapheal
5 pages
Web Programming Complete Notes
No ratings yet
Web Programming Complete Notes
70 pages
Unit I: Introduction To Web Development
No ratings yet
Unit I: Introduction To Web Development
28 pages
Internet and Web Technology: Module 1: Web Development Introduction (URL - Uniform Resource Locator)
No ratings yet
Internet and Web Technology: Module 1: Web Development Introduction (URL - Uniform Resource Locator)
18 pages
Lecture 1 URL, TLD and WEB 1.0 and 2.0
No ratings yet
Lecture 1 URL, TLD and WEB 1.0 and 2.0
15 pages
unit-12
No ratings yet
unit-12
41 pages
HTTP Fundamentals For API Testing
No ratings yet
HTTP Fundamentals For API Testing
5 pages
URL Vs URI: Most Important Differences You Must Know What Is The URL?
No ratings yet
URL Vs URI: Most Important Differences You Must Know What Is The URL?
7 pages
2. WEB PROTOCOL
No ratings yet
2. WEB PROTOCOL
24 pages
A Practical Guide To Writing Clients and Servers: Go To Table of Contents Go To Footnotes Go To Other Tutorials
No ratings yet
A Practical Guide To Writing Clients and Servers: Go To Table of Contents Go To Footnotes Go To Other Tutorials
18 pages
Web Programming
No ratings yet
Web Programming
36 pages
Network 6
No ratings yet
Network 6
17 pages
WP Exam Capsules Module1
No ratings yet
WP Exam Capsules Module1
11 pages
2
No ratings yet
2
13 pages
URL and Parts of URL
No ratings yet
URL and Parts of URL
5 pages
HTTP
No ratings yet
HTTP
30 pages
IP UNIT-2 NOTES
No ratings yet
IP UNIT-2 NOTES
57 pages
CN_N_4
No ratings yet
CN_N_4
7 pages
Lab1
No ratings yet
Lab1
6 pages
about_url
No ratings yet
about_url
3 pages
Done By: Mohd Syukri Sakinah Noor ND/CST/33
No ratings yet
Done By: Mohd Syukri Sakinah Noor ND/CST/33
18 pages
Web Programming (MP3)
No ratings yet
Web Programming (MP3)
15 pages
World-Wide-Web
No ratings yet
World-Wide-Web
21 pages
HTML Url Encoding
No ratings yet
HTML Url Encoding
2 pages
Protocolo HTTP
No ratings yet
Protocolo HTTP
6 pages
Web System & Development Assignment #2
No ratings yet
Web System & Development Assignment #2
1 page
Curl Tutorial
No ratings yet
Curl Tutorial
15 pages
Mental Health
No ratings yet
Mental Health
11 pages
Parts-of-URL
No ratings yet
Parts-of-URL
14 pages
Domain Name System
No ratings yet
Domain Name System
8 pages
Evaluation of Some SMTP Testing, Email Verification, Header Analysis, SSL Checkers, Email Delivery, Email Forwarding and WordPress Email Tools
From Everand
Evaluation of Some SMTP Testing, Email Verification, Header Analysis, SSL Checkers, Email Delivery, Email Forwarding and WordPress Email Tools
Dr. Hidaia Mahmood Alassoulii
No ratings yet
Cruz Vs - Denr Sec
No ratings yet
Cruz Vs - Denr Sec
5 pages
Chavez Vs - PEA Digest
No ratings yet
Chavez Vs - PEA Digest
40 pages
Class Standing: Christ The King Collegecalbayog Citycomputation Sheet
No ratings yet
Class Standing: Christ The King Collegecalbayog Citycomputation Sheet
9 pages
G.R. No. 163766 June 22, 2006 Republic of The Philippines, Petitioner, CANDY MAKER, INC., As Represented by Its President, ONG YEE SEE, Respondent
No ratings yet
G.R. No. 163766 June 22, 2006 Republic of The Philippines, Petitioner, CANDY MAKER, INC., As Represented by Its President, ONG YEE SEE, Respondent
9 pages
House of Representatives
No ratings yet
House of Representatives
2 pages
Chapter-1 - Digital Systems and Binary Numbers PDF
0% (1)
Chapter-1 - Digital Systems and Binary Numbers PDF
102 pages
Module 5 - Reading4 - InformationPrivacy
No ratings yet
Module 5 - Reading4 - InformationPrivacy
10 pages
Module 5 - Reading6 - StructuredProgramming
No ratings yet
Module 5 - Reading6 - StructuredProgramming
11 pages
Module 5 - Reading5 - SystemDevelopment
No ratings yet
Module 5 - Reading5 - SystemDevelopment
13 pages
Module 5 - Reading7 - SoftwareDevelopmentProcess
No ratings yet
Module 5 - Reading7 - SoftwareDevelopmentProcess
15 pages
Module 5 - Reading1 - SystemDevelopment
No ratings yet
Module 5 - Reading1 - SystemDevelopment
26 pages
Module 6 - Reading1 - NetworksandTelecommunications
No ratings yet
Module 6 - Reading1 - NetworksandTelecommunications
8 pages
Module 6 - Reading5 - InternetSecurity
No ratings yet
Module 6 - Reading5 - InternetSecurity
7 pages
Module 6 - Reading2 - SecurityandSocialIssues
No ratings yet
Module 6 - Reading2 - SecurityandSocialIssues
8 pages
Module 4 - Reading6 - Web - Browser
No ratings yet
Module 4 - Reading6 - Web - Browser
9 pages
Module 6 - Reading3 - TelecommunicationNetwork
No ratings yet
Module 6 - Reading3 - TelecommunicationNetwork
5 pages
Module 4 - Reading2 - E-Commere
No ratings yet
Module 4 - Reading2 - E-Commere
10 pages
Module 4 - Reading9 - InternetSecurity
No ratings yet
Module 4 - Reading9 - InternetSecurity
11 pages
Module 5
No ratings yet
Module 5
77 pages
NIT - NIMCET - 2008 Information Brochure
No ratings yet
NIT - NIMCET - 2008 Information Brochure
10 pages
DESTINY 6100: Installation Instructions
No ratings yet
DESTINY 6100: Installation Instructions
84 pages
Pelco Protocol V3.0
No ratings yet
Pelco Protocol V3.0
17 pages
CE423 Unit 3b EPANET Software Exercise Activity
No ratings yet
CE423 Unit 3b EPANET Software Exercise Activity
15 pages
SSH Cadangan 21 Peb (Sfile
No ratings yet
SSH Cadangan 21 Peb (Sfile
3 pages
Uuuu U U U U: Registers (16-Bit)
No ratings yet
Uuuu U U U U: Registers (16-Bit)
3 pages
AVID S6L
No ratings yet
AVID S6L
6 pages
EIIT
No ratings yet
EIIT
3 pages
RMI
No ratings yet
RMI
3 pages
Exercise 1
No ratings yet
Exercise 1
1 page
Volvo EC140C L (EC140CL) Excavator Service Repair Manual Instant Download
No ratings yet
Volvo EC140C L (EC140CL) Excavator Service Repair Manual Instant Download
22 pages
Ecm Gain Setting
No ratings yet
Ecm Gain Setting
2 pages
Aquilion Lightning 80 Aice
No ratings yet
Aquilion Lightning 80 Aice
15 pages
Cambridge International AS & A Level: Computer Science 9618/41
No ratings yet
Cambridge International AS & A Level: Computer Science 9618/41
12 pages
Big Data (6CS030) Individual Assignment
No ratings yet
Big Data (6CS030) Individual Assignment
9 pages
Blockchain-Based Service Network (BSN) Introductory White Paper
100% (2)
Blockchain-Based Service Network (BSN) Introductory White Paper
20 pages
Rayabarapu Sai Krishna 23 Jan MNG
No ratings yet
Rayabarapu Sai Krishna 23 Jan MNG
5 pages
AR Customer & Invoice Interface: AR - INTF - 001: Application Module Name
No ratings yet
AR Customer & Invoice Interface: AR - INTF - 001: Application Module Name
20 pages
LT Handy2000Cordless en
0% (1)
LT Handy2000Cordless en
2 pages
Silo - Tips - Guide To Snare For Windows v42
No ratings yet
Silo - Tips - Guide To Snare For Windows v42
48 pages
Creating Classical Oil Portrait:Cesar Santos
No ratings yet
Creating Classical Oil Portrait:Cesar Santos
3 pages
Faizan Faisal CV's
No ratings yet
Faizan Faisal CV's
2 pages
Week 5 Content Analysis Grounded Theory
No ratings yet
Week 5 Content Analysis Grounded Theory
4 pages
CG - MANUAL (18 Scheme) bmk2021
No ratings yet
CG - MANUAL (18 Scheme) bmk2021
51 pages
Infineon-AURIX TC3xx System Architecture-Training-v01 00-EN
No ratings yet
Infineon-AURIX TC3xx System Architecture-Training-v01 00-EN
9 pages
BSBCRT412 Topic 1
No ratings yet
BSBCRT412 Topic 1
17 pages
RRL References
No ratings yet
RRL References
4 pages
IManager U2000 V200R015C60 Single-Server System Software Installation and Commissioning Gui
100% (1)
IManager U2000 V200R015C60 Single-Server System Software Installation and Commissioning Gui
415 pages