|
|
Regexp::Common::URI -- provide patterns for URIs.
use Regexp::Common qw /URI/;
while (<>) { /$RE{URI}{HTTP}/ and print "Contains an HTTP URI.\n"; }
Patterns for the following URIs are supported: fax, file, FTP, gopher, HTTP, news, NTTP, pop, prospero, tel, telnet, tv and WAIS. Each is documented in the Regexp::Common::URI::scheme, manual page, for the appropriate scheme (in lowercase), except for NNTP URIs which are found in Regexp::Common::URI::news.
$RE{URI}
Return a pattern that recognizes any of the supported URIs. With
{-keep}
, only the entire URI is returned (in $1
).
Zigmond, D. and Vickers, M: Uniform Resource Identifiers for Television Broadcasts. December 2000.
Casey, James: A FTP URL Format. November 1996.
Mockapetris, P.: DOMAIN NAMES - IMPLEMENTATION AND SPECIFICATION. November 1987.
Berners-Lee, Tim, Masinter, L., McCahill, M.: Uniform Resource Locators (URL). December 1994.
Berners-Lee, Tim, Fielding, R., and Masinter, L.: Uniform Resource Identifiers (URI): Generic Syntax. August 1998.
Fielding, R., Gettys, J., Mogul, J., Frystyk, H., Masinter, L., Leach, P. and Berners-Lee, Tim: Hypertext Transfer Protocol -- HTTP/1.1. June 1999.
Vaha-Sipila, A.: URLs for Telephone Calls. April 2000.
$Log: URI.pm,v $ Revision 2.108 2004/06/09 21:45:24 abigail POD
Revision 2.107 2003/03/25 23:20:30 abigail pop and prospero URIs
Revision 2.106 2003/03/12 22:28:57 abigail WAIS URIs
Revision 2.105 2003/02/21 14:49:41 abigail Gopher added
Revision 2.104 2003/02/11 14:10:25 abigail Changed 'nntp' to 'NNTP'
Revision 2.103 2003/02/10 21:18:07 abigail Move most of the code into separate files. One file per URI, and one file per RFC.
Revision 2.102 2003/02/07 15:24:17 abigail telnet URIs
Revision 2.101 2003/02/01 22:55:31 abigail Changed Copyright years
Revision 2.100 2003/01/21 23:19:40 abigail The whole world understands RCS/CVS version numbers, that 1.9 is an older version than 1.10. Except CPAN. Curse the idiot(s) who think that version numbers are floats (in which universe do floats have more than one decimal dot?). Everything is bumped to version 2.100 because CPAN couldn't deal with the fact one file had version 1.10.
Revision 1.11 2003/01/21 22:59:33 abigail Fixed small errors with and
Revision 1.10 2003/01/17 13:17:15 abigail Fixed '$toplabel' and '$domainlabel'; they were both subexpressions of the form: A|AB. Which passed the tests because most tests anchor the regex at the beginning and end.
Revision 1.9 2003/01/01 23:00:54 abigail TV URIs
Revision 1.8 2002/08/27 16:56:27 abigail Support for fax URIs.
Revision 1.7 2002/08/06 14:44:07 abigail Local phone numbers can have future extensions as well.
Revision 1.6 2002/08/06 13:18:03 abigail Cosmetic changes
Revision 1.5 2002/08/06 13:16:27 abigail Added $RE{URI}{tel}{nofuture}
Revision 1.4 2002/08/06 00:03:30 abigail Added $RE{URI}{tel}
Revision 1.3 2002/08/04 22:51:35 abigail Added FTP URIs.
Revision 1.2 2002/07/25 22:37:44 abigail Added 'use strict'. Added 'no_defaults' to 'use Regex::Common' to prevent loading of all defaults.
Revision 1.1 2002/07/25 19:56:07 abigail Modularizing Regexp::Common.
the Regexp::Common manpage for a general description of how to use this interface.
Damian Conway (damian@conway.org)
This package is maintained by Abigail (regexp-common@abigail.nl).
Bound to be plenty.
For a start, there are many common regexes missing. Send them in to regexp-common@abigail.nl.
Copyright (c) 2001 - 2003, Damian Conway. All Rights Reserved. This module is free software. It may be used, redistributed and/or modified under the terms of the Perl Artistic License (see http://www.perl.com/perl/misc/Artistic.html)