Google Hacks By Tara Calishain & Rael Dornfest The following corrections were made to the 3/04 reprint: Here's a key to the markup: [page-number]: serious technical mistake {page-number}: minor technical mistake : important language/formatting problem (page-number): language change or minor formatting problem (cover) Tara Calishain's name IS NOW spelled correctly on the spine of the book. (xii) Nine lines from bottom; innovate rather then NOW READS: innovate rather than p. xviii line 4: "uncovering documented features" NOW READS: "uncovering undocumented features" (15) Second sentence; Sometimes this is a description or a sentence or so. NOW READS: "Sometimes this is a description of the site or a selected sentence or two." (18) Description of The On-Line Medical Dictionary; The list 'biochemistry, cell biology, chemistry, medicine, molecular biology, physics, plant biology, radiobiology, science and technology' was repeated. The second instance of this listing HAS BEEN DELETED. (21) three lines from bottom: occurence NOW READS: occurrence (26) Third sentence; The link: domain does not mix. NOW READS: The link: syntax does not mix. [33] Custom Google Search Form;code was missing submit button. Following line WAS ADDED prior to the line: (38) 9 lines from top: our fathers NOW READS: our forefathers (43) 11 lines down: fast an easy NOW READS: fast and easy (48) 3 lines up from bottom; "and they're unusal enough" NOW READS: "and they're usually enough" (56) down 6 lines: miriad NOW READS: myriad (61) 8 lines up from the bottom "the visit the site itself." NOW READS: "to visit the site itself." [100] In "sub doGoogleSearch" function; The line: # Parse the XML my $results = XMLin($content); NOW READS (two lines, instead of one): # Parse the XML my $results = XMLin($content); {116} in the code 2nd to last line in the code NOW READS: print qq{"$title","$url","$size","$suffix"\n}; (124) example under "The Results"; the command: $ perl groups2csv.pl < groups.html > groups.csv NOW READS: $ perl groups2csv.pl < groups.html [126] 4th line; $output =~ s!!!g; # drop all HTML tags NOW READS: $output =~ s!<.+?>!!g; # drop all HTML tags (128) 6th paragraph: NOW READS: of any data (130) 3 lines from bottom: do it's thing NOW READS: do its thing (131) 2nd example under "Running the Hack"; $ perl phonebook2csv.pl < results.html > results.csv NOW READS: $ perl phonebook2csv.pl < results.html (134) end para 2: alotted NOW READS: allotted (140) mid-para 2: Goole Web NOW READS: Google Web (141) IN PRINT: "The 10-Result Limit" section, last sentence; "...10 at a time [Hack #1]." NOW READS: "...10 at a time [Hack #51]." (145) IN PRINT: Lines 10 and 11 of hack #51; "# Keep track of result number my $number = 0;" NOW READS: "# Keep track of result number my $number = 0;" (146) 1st text para: defacto NOW READS: de facto (152) 1st text para: Its no different NOW READS: it's no different {176} In the code; Warn "Searcing Google for $query\n" NOW READS: Warn "Searcing Google for $query\n"; {208} 3rd paragraph, second sentence: "With Userland and Manila, it's as easy as ... " NOW READS: "With Radio Userland and Manila, it's as easy as ..." (210) mid-page: incoporate NOW READS: incorporate (216) The last sentence before the figure 6-10.; Figure 6-10 illustrates the I'm Feeling Lucky Search. NOW READS: Figure 6-10 illustrates the I'm Feeling VERY Lucky Search. (225) The first paragraph in Hack #72; http://onfocus.com/Book Watch/ NOW READS: http://onfocus.com/BookWatch/ (238) 9 lines from bottom: comes and gos NOW READS: comes and goes (245) bottom line: music, and more. NOW READS: music and more. (248) fifth line: sensation, Eminem, is NOW READS: sensation Eminem is (270) 3rd Paragraph from the bottom, the sentence that starts "You can also fool Google..." "...at the sime time." NOW READS: "...at the same time." (272) 3/4 of the way down the page ($whacktwo) = ($whacktwo =~ !m

(.*)

!i); NOW READS: ($whacktwo) = ($whacktwo =~ m!

(.*)

!i); (292) bullet #15: "out weight" NOW READS: 'outweigh' (298) Tagline of section; "Solid content thoughfully prepared..." NOW READS: "Solid content thoughtfully prepared..." In the first paragraph, the same mistake HAS BEEN CORRECTED. (301) 4th para: on it's topic NOW READS: on its topic (302) Item R: content on to NOW READS: content onto