|
|
Regexp::Common::URI::http -- Returns a pattern for HTTP URIs.
use Regexp::Common qw /URI/;
while (<>) { /$RE{URI}{HTTP}/ and print "Contains an HTTP URI.\n"; }
Provides a regex for an HTTP URI as defined by RFC 2396 (generic syntax) and RFC 2616 (HTTP).
If -scheme => P
is specified the pattern P is used as the scheme.
By default P is qr/http/
. https
and https?
are reasonable
alternatives.
The syntax for an HTTP URI is:
"http:" "//" host [ ":" port ] [ "/" path [ "?" query ]]
Under {-keep}
, the following are returned:
The entire URI.
The scheme.
The host (name or address).
The port (if any).
The absolute path, including the query and leading slash.
The absolute path, including the query, without the leading slash.
The absolute path, without the query or leading slash.
The query, without the question mark.
Berners-Lee, Tim, Fielding, R., and Masinter, L.: Uniform Resource Identifiers (URI): Generic Syntax. August 1998.
Fielding, R., Gettys, J., Mogul, J., Frystyk, H., Masinter, L., Leach, P. and Berners-Lee, Tim: Hypertext Transfer Protocol -- HTTP/1.1. June 1999.
$Log: http.pm,v $ Revision 2.101 2004/06/09 21:42:48 abigail POD nits
Revision 2.100 2003/02/10 21:06:41 abigail http URI
the Regexp::Common::URI manpage for other supported URIs.
Damian Conway (damian@conway.org)
This package is maintained by Abigail (regexp-common@abigail.nl).
Bound to be plenty.
Copyright (c) 2001 - 2003, Damian Conway. All Rights Reserved. This module is free software. It may be used, redistributed and/or modified under the terms of the Perl Artistic License (see http://www.perl.com/perl/misc/Artistic.html)