This simple code should be able to dig out all file names from inside the A HREF tags where the file name consists of letters, numbers, a few special characters, spaces, file path forward slashes and periods; and ends with an extension of 2-4 characters. Each entry will be downloaded, however, take note that the A HREF data will contain a relative path to the item, including the directory structure. The webclient downloadfile method's second parameter wants a path name, including file name, for the destination. If the full path doesn't exist, the file may just get put in the current directory.
$wc = new-object net.webclient
$sitename = "http://somesite/somedirectory"
$weblisting = $wc.downloadstring($sitename)
$items = select-string '"[a-zA-Z0-9/._-() ]*\.[a-zA-Z0-9]{2,4}"' `
-input $weblisting -allmatches|
foreach {$_.matches.value.replace('"','')}
foreach ($item in $items) {
$wc.downloadfile($sitename + $item, ".\" + $item)
}
No comments:
Post a Comment