Browser you have is obsolate. Please, download the modern Firefox, Chrome, Opera or Yandex browser for comfort surfing!
  
World Software Catalog
Internet catalog of free and paid applications of the World
  
 RU  EN 
Software search
Description language 1Description language 2Description language 3Description language 4
PlatformLicenseASP member
ClassSpecificCategory
NamePublisher/developer
KeywordsDescription
      
Webscraper 4.15.3
Company: PeacockMedia
Country: United Kingdom, Leicestershire, Ashby de la Zouch
ASP member: Yes
Company Web site: http://peacockmedia.software
Site of program: http://peacockmedia.software/mac/webscraper/
Application info: https://peacockmedia.software/mac/webscraper/
Video: https://www.youtube.com/watch?v=JWuXsLen2YE

Author: Shiela Dixon

Sales email: scrutiny.pad@submitpad.org
Support email: scrutiny.pad@submitpad.org

License type: Shareware
Class: Network & Internet::Search/Lookup Tools
Specific:
Categories: Internet :: Website Management, Internet :: Miscellaneous, Utilities :: Miscellaneous
Platform: OS X
OS: Mac OS X,
System requirements: MacOS 10.13 or higher, Intel or Apple Silicon
Language: English
Limitations: 30 day trial period, during which output file is limited to 5 rows for evaluation purposes

  
Keywords: scraper, website, search, extract, extraction, spider, crawler, scanner, mining, data

WebScraper uses the Integrity v6 Engine to quickly scan a website, and can output the data (currently) as csv or json. The output can include various meta data, the entire content of each page (as text, html or markdown) and can extract parts of the pages (currently a named class, id or itemprop of divs, spans, dd's or p's).

Webscraper is new. Please use it for free and please get in touch with any requests, bug reports or observations.

Easy to scan a site - just enter the starting url and press Go
Easy to export - checkboxes for the columns you want
Plenty of options / configuration
Configuration of various limits on the crawl and the output file size
Report Malware



 0    
8.17 MB

~25.00$
~23.13€
DateVersionStatusRelease history
12 Oct 20214.15.3Major UpdateAdds setting 'Legacy webview'. The new default = use the up-to-date WebKit webview for rendering, however, the legacy version may work better in some cases
The setting 'Attempt authentication' has been relabeled 'handle cookies'
13 Aug 20214.15.2Major UpdateSmall enhancement concerning downloading of images where the 'single folder' option is chosen.
Adds timeout control under Advanced Scan Settings.
Adds option to 'render page / run js' before parsing it for links
Fixes a problem preventing scanning of a list of urls
now Universal Binary Intel/M1
14 Jan 20214.14.4Minor UpdateFixes a problem preventing scanning of a list of urls
03 Dec 20204.14.3Major UpdateUpdates the selectable user-agent strings and adds more
Changes default setting for treating http:// links on the same domain (when starting with an https:// url). Now treats them as internal
Fixes a problem with the plain text content option
Inherits some general updates in the crawling engine
31 Aug 20204.14.1Major UpdateAdds option to recreate directory structure when downloading pdfs or images to a local folder
Fixes crash where a regex is found on the page but the collecting part is empty string.
23 Aug 20204.13.0Major UpdateImprovements to crawling engine, particularly with regard to image discovery; now finds image urls within inline styles
30 Jul 20204.12.1Major UpdateAdds option to download and save pdf files to a folder as it scans
Adds support for charset=GBK, charset=koi8-r, charset=euc-kr and some other Latin and non-Latin character encodings.
16 Mar 20204.11.0Major UpdateAdds option in simple setup and complex setup for scraping email addresses.
Adds field in Preferences for editing the regular expression that is used when scraping email addresses.
11 Dec 20194.10.2Major UpdateCan use the ProxyCrawl service
Adds option to strip html markup from results of class/id or regex extraction
Adds td to the list of tags which are searched when you specify a class or id
Many other fixes and enhancements
Distribution permissions: The dmg of WebScraper can be freely distributed over the internet in an unchanged form, no repackaging.

Ratio:

Back  Top

 
  0  0
New on site
March 2024
   Su   Mo   Tu   We   Th   Fr   Sa   
             1   2   
   3   4   5   6   7   8   9   
   10   11   12   13   14   15   16   
   17   18   19   20   21   22   23   
   24   25   26   27   28   29   30   
   31               
 29 March 2024 year, Friday 
User
Autorization
e-mail:

password:


Register
RSS-feed
RSS-лента    Valid RSS
Online
Guests: 6
Users: 0
Bots: 74
Total users: 52
Banners

Copyright © 2020-2024 MaaSoftware OOO