1URI::Find::Schemeless(3U)ser Contributed Perl DocumentatiUoRnI::Find::Schemeless(3)
2
3
4
6 URI::Find::Schemeless - Find schemeless URIs in arbitrary text.
7
9 require URI::Find::Schemeless;
10
11 my $finder = URI::Find::Schemeless->new(\&callback);
12
13 The rest is the same as URI::Find.
14
16 URI::Find finds absolute URIs in plain text with some weak heuristics
17 for finding schemeless URIs. This subclass is for finding things which
18 might be URIs in free text. Things like "www.foo.com" and
19 "lifes.a.bitch.if.you.aint.got.net".
20
21 The heuristics are such that it hopefully finds a minimum of false
22 positives, but there's no easy way for it know if "COMMAND.COM" refers
23 to a web site or a file.
24
25 top_level_domain_re
26
27 my $tld_re = $self->top_level_domain_re;
28
29 Returns the regex for matching top level DNS domains. The regex
30 shouldn't be anchored, it shouldn't do any capturing matches, and it
31 should make itself ignore case.
32
34 Original code by Roderick Schertler <roderick@argon.org>, adapted by
35 Michael G Schwern <schwern@pobox.com>.
36
37 Currently maintained by Roderick Schertler <roderick@argon.org>.
38
40 L<URI::Find>
41
42
43
44perl v5.32.1 2021-01-27 URI::Find::Schemeless(3)