Return to Index

[download | view]

Hockliffe Project (hock)

Description

The Hockliffe Project has been designed to promote the study of early British children's literature. It will provide internet access to the full texts of the Hockliffe Collection of Early Children's Books, owned by De Montfort University, and will accompany this archive with contextualising documents and research. The aim is to work towards a reevaluation of children's literature in its own infancy, and to let these rich and varied books speak for themselves.

It is unclear as to whether or not their mechanically reproduced page images are eligible for copyright protection.

Formats

The hock module supports the following download formats:

-f value

Description

image

Pages are returned as one JPEG image per page.

ID

This one is easy. The ID is a four-digit catalog number (####). To obtain the ID, just go to the project and browse to the book you want. The catalog number is in the far left column on the browse page. Note that only books marked as having images can be downloaded.

History

Module Printout

# hock : Hockliffe Project#
# This module will allow you to download page images from
# the Hockliffe Project site. See the documentation
# for additional details


$module_default_format = "image";

%module_formats = (
  'image'  => 'jpg',
  'jpg'    => 'jpg',
  'jpeg'   => 'jpg',
);


$module_format = module_check_format($module_default_format,keys(%module_formats));
$config{'ext'} = $module_formats{$module_format};


$module_url{'plistbase'} = "http://www.cts.dmu.ac.uk/hockliffe/$idno/";
$module_url{'imagebase'} = $module_url{'plistbase'};

# Get the page listing (so we can get the high page #)
print_v("Getting page listing: ".$module_url{'plistbase'});

# Get the introductory page (for the "Click here" link)
$res = $ua->get($module_url{'plistbase'});

if ( $res->is_error ) {
  print STDERR "Could not get page listing\n".$res->status_line;
  exit();
}

$res->content =~ m!(\d+).jpg</A>[^<]+</PRE>!s;
$module_available_high = $1;
$module_available_high =~ s/^${idno}*//;

# Set the limit if a limit is set
$module_actual_high = module_set_limit($module_available_high);


foreach $module_i ( $config{'start'} .. $module_actual_high ) {
  $i = sprintf("%03d",$module_i);
  $module_image_url = $module_url{'imagebase'}.$idno.$i.".".$config{'ext'};
  print_v("Page URL: $module_image_url");
  push(@urls,$module_image_url);
}

Valid XHTML 1.0! Valid CSS! PDA Friendly