Octoparse is easy to use no coder tool that helps you fetch data from any websites.
You can use this scraper tool to scrape many websites, like Amazon, eBay, AliExpress, Priceline and etc, for data including price, reviews, comments and etc. Users don't need to know how to code to scrape data, but they need to learn to configure their tasks.The extraction of hundreds or more data can be completed within seconds.
In the Octoparse you have to create task to create a scrapper.
We will see Gherkin for how Octoparse extract data.
1 Scenario : Create Task from pre build template
Given : Two option Task Template and Advanced Mode
When : Click on Task Template
Then : Already built in Task for Amazon ,Facebook, Twitter should displayed
2. Scenario :Create Task using Advanced mode
Given: Two option Task Template and Advanced Mode
When : Click on Advanced Mode
Then : New Task should open with Enter URL manually text box
3. Scenario: Enter URL from where you have to extract data
Given : New task page should open
When: Enter URL manually in URL text box
Then: URL should accepted
4. Scenario: Saving the task
Given: New task with URL entered
When: Click on Save button
Then : Task should save and process of scrapping data should proceed
5. Scenario: Switch to Workflow mode
Given : Saved task should open
When : Click on Workflow Mode
Then : Task should display in Workflow mode
6. Scenario : Create pagination loop
Given : Task should open in workflow mode
When : Scroll down and click on Next button
Then : Number of actions provided under the Action tips menu
7. Scenario: Create Pagination for next page
Given : Action Tips should displayed
When : Click on Loop click next page
Then : Workflow should updated with pagination loop
8. Scenario : Create a loop item form item name
Given : Task should open in Work flow mode with pagination loop
When : Click on very first item name displayed on search page
Then : Selected Name should highlighted in green and other names should be highlighted in red
9. Scenario: Create loop item for item name
Given : Action Tips displayed
When : Click on Select All from the Action Tips menu
Then : All Items names should highlighted in green and selected
10. Scenario : Click on each item name
Given : Action Tips menu displayed
When : Click on Loop click each element from Action tip menu
Then : Workflow should change with loop item selected
11. Scenario: Select Name to extract data of all names
Given : Details page off each item should displayed
When : Click on each Name field
Then : All names should be highlighted and Action Tips menu displayed
12. Scenario: Select Phone Number to extract data of all names
Given : Details page off each item should displayed
When : Click on each Phone Number field
Then : All Phone Number should be highlighted and Action Tips menu displayed
13. Scenario: Select Rating to extract data of all Ratings
Given : Details page off each item should displayed
When : Click on each Rating field
Then : All ratings should be highlighted and Action Tips menu displayed
14. Scenario: Select Address to extract data of all Addresses
Given : Details page off each Address field
When : Click on each Phone Number field
Then : All Address should be highlighted and Action Tips menu displayed
15. Scenario: Select Price to extract data of all prices
Given : Details page off each Address field
When : Click on each Price field
Then : All Price should be highlighted and Action Tips menu displayed
16. Scenario: Select Image to extract data of all Images
Given : Details page off each Address field
When : Click on each Image field
Then : All Image should be highlighted and Action Tips menu displayed
17. Scenario: Select data to extract
Given : Action Tips menu display
When : Click on Extract Data from Action Tips menu
Then : All fields should displayed in table format in workflow
18. Scenario : Change fields name
Given : Fields in table format should displayed in table format
When : Write appropriate names for field
Then : Field names should saved in workflow
19. Scenario : Saving Task
Given : Task with workflow format opened
When: Click on Save button
Then : Task should build
20. Scenario : Run the task to get data
Given : Already build task opened
When : Click on Run task
Then : Two options should displayed Local Extraction and Cloud Extraction
21. Scenario : Run task on local device
Given : Two options Local Extraction and Cloud Extraction
When : Click on Local Extraction to extract data on local device
Then : Data should extracted on local device
22. Scenario : Run task on Cloud
Given : Two options Local Extraction and Cloud Extraction
When : Click on Cloud Extraction to extract data on cloud
Then : Data should extracted on cloud and can be access from anywhere and on any device.
23. Scenario: Detect Name
Given: Enter URl https://www.amazon.com/s?k=laptop&ref=nb_sb_noss
When: Click on 2020 Premium HP Stream 11 Laptop Computer 11.6" HD WLED Anti- Glare Intel Celeron Processor N4000 4GB RAM 32GB eMMC Office 365 Personal USB-C WiFi HDMI Win 10 + iCarp HDMI Cable
Then: Should display name 2020 Premium HP Stream 11 Laptop Computer 11.6" HD WLED Anti-Glare Intel Celeron Processor N4000 4GB RAM 32GB eMMC Office 365 Personal USB-C WiFi HDMI Win 10 + iCarp HDMI Cable
24. Scenario: Detect Price
Given : Enter URl https://www.amazon.com/s?k=laptop&ref=nb_sb_noss
When : Click on 2020 Premium HP Stream 11 Laptop Computer 11.6" HD WLED Anti- Glare Intel Celeron Processor N4000 4GB RAM 32GB eMMC Office 365 Personal USB-C WiFi HDMI Win 10 + iCarp HDMI Cable
Then: Should display price $349.00
25. Scenario: Detect Brand
Given : Enter URl https://www.amazon.com/s?k=laptop&ref=nb_sb_noss
When : Click on 2020 Premium HP Stream 11 Laptop Computer 11.6" HD WLED Anti- Glare Intel Celeron Processor N4000 4GB RAM 32GB eMMC Office 365 Personal USB-C WiFi HDMI Win 10 + iCarp HDMI Cable
Then: Should display price $349.00
26. Scenario: Detect Image
Given : Enter URl https://www.amazon.com/s?k=laptop&ref=nb_sb_noss
When : Click on 2020 Premium HP Stream 11 Laptop Computer 11.6" HD WLED Anti- Glare Intel Celeron Processor N4000 4GB RAM 32GB eMMC Office 365 Personal USB-C WiFi HDMI Win 10 + iCarp HDMI Cable
Then: Should display image of laptop
27. Scenario: Detect Operating system
Given : Enter URl https://www.amazon.com/s?k=laptop&ref=nb_sb_noss
When : Click on 2020 Premium HP Stream 11 Laptop Computer 11.6" HD WLED Anti- Glare Intel Celeron Processor N4000 4GB RAM 32GB eMMC Office 365 Personal USB-C WiFi HDMI Win 10 + iCarp HDMI Cable
Then: Should display operating system Windows 10
Komentarze