Data Scrapping tool Octoparse 8 with Gherkin format

Octoparse is easy to use no coder tool that helps you fetch data from any websites.

You can use this scraper tool to scrape many websites, like Amazon, eBay, AliExpress, Priceline and etc, for data including price, reviews, comments and etc. Users don't need to know how to code to scrape data, but they need to learn to configure their tasks.The extraction of hundreds or more data can be completed within seconds.

In the Octoparse you have to create task to create a scrapper.

We will see Gherkin for how Octoparse extract data.


1 Scenario : Create Task from pre build template

Given : Two option Task Template and Advanced Mode

When : Click on Task Template

Then : Already built in Task for Amazon ,Facebook, Twitter should displayed



2. Scenario :Create Task using Advanced mode

Given: Two option Task Template and Advanced Mode

When : Click on Advanced Mode

Then : New Task should open with Enter URL manually text box


3. Scenario: Enter URL from where you have to extract data

Given : New task page should open

When: Enter URL manually in URL text box

Then: URL should accepted


4. Scenario: Saving the task

Given: New task with URL entered

When: Click on Save button

Then : Task should save and process of scrapping data should proceed


5. Scenario: Switch to Workflow mode

Given : Saved task should open

When : Click on Workflow Mode

Then : Task should display in Workflow mode


6. Scenario : Create pagination loop

Given : Task should open in workflow mode

When : Scroll down and click on Next button

Then : Number of actions provided under the Action tips menu


7. Scenario: Create Pagination for next page

Given : Action Tips should displayed

When : Click on Loop click next page

Then : Workflow should updated with pagination loop


8. Scenario : Create a loop item form item name

Given : Task should open in Work flow mode with pagination loop

When : Click on very first item name displayed on search page

Then : Selected Name should highlighted in green and other names should be highlighted in red


9. Scenario: Create loop item for item name

Given : Action Tips displayed

When : Click on Select All from the Action Tips menu

Then : All Items names should highlighted in green and selected


10. Scenario : Click on each item name

Given : Action Tips menu displayed

When : Click on Loop click each element from Action tip menu

Then : Workflow should change with loop item selected


11. Scenario: Select Name to extract data of all names

Given : Details page off each item should displayed

When : Click on each Name field

Then : All names should be highlighted and Action Tips menu displayed


12. Scenario: Select Phone Number to extract data of all names

Given : Details page off each item should displayed

When : Click on each Phone Number field

Then : All Phone Number should be highlighted and Action Tips menu displayed


13. Scenario: Select Rating to extract data of all Ratings

Given : Details page off each item should displayed

When : Click on each Rating field

Then : All ratings should be highlighted and Action Tips menu displayed


14. Scenario: Select Address to extract data of all Addresses

Given : Details page off each Address field

When : Click on each Phone Number field

Then : All Address should be highlighted and Action Tips menu displayed


15. Scenario: Select Price to extract data of all prices

Given : Details page off each Address field

When : Click on each Price field

Then : All Price should be highlighted and Action Tips menu displayed


16. Scenario: Select Image to extract data of all Images

Given : Details page off each Address field

When : Click on each Image field

Then : All Image should be highlighted and Action Tips menu displayed



17. Scenario: Select data to extract

Given : Action Tips menu display

When : Click on Extract Data from Action Tips menu

Then : All fields should displayed in table format in workflow


18. Scenario : Change fields name

Given : Fields in table format should displayed in table format

When : Write appropriate names for field

Then : Field names should saved in workflow


19. Scenario : Saving Task

Given : Task with workflow format opened

When: Click on Save button

Then : Task should build


20. Scenario : Run the task to get data

Given : Already build task opened

When : Click on Run task

Then : Two options should displayed Local Extraction and Cloud Extraction


21. Scenario : Run task on local device

Given : Two options Local Extraction and Cloud Extraction

When : Click on Local Extraction to extract data on local device

Then : Data should extracted on local device


22. Scenario : Run task on Cloud

Given : Two options Local Extraction and Cloud Extraction

When : Click on Cloud Extraction to extract data on cloud

Then : Data should extracted on cloud and can be access from anywhere and on any device.


23. Scenario: Detect Name

Given: Enter URl https://www.amazon.com/s?k=laptop&ref=nb_sb_noss

When: Click on 2020 Premium HP Stream 11 Laptop Computer 11.6" HD WLED Anti- Glare Intel Celeron Processor N4000 4GB RAM 32GB eMMC Office 365 Personal USB-C WiFi HDMI Win 10 + iCarp HDMI Cable

Then: Should display name 2020 Premium HP Stream 11 Laptop Computer 11.6" HD WLED Anti-Glare Intel Celeron Processor N4000 4GB RAM 32GB eMMC Office 365 Personal USB-C WiFi HDMI Win 10 + iCarp HDMI Cable


24. Scenario: Detect Price

Given : Enter URl https://www.amazon.com/s?k=laptop&ref=nb_sb_noss

When : Click on 2020 Premium HP Stream 11 Laptop Computer 11.6" HD WLED Anti- Glare Intel Celeron Processor N4000 4GB RAM 32GB eMMC Office 365 Personal USB-C WiFi HDMI Win 10 + iCarp HDMI Cable

Then: Should display price $349.00


25. Scenario: Detect Brand

Given : Enter URl https://www.amazon.com/s?k=laptop&ref=nb_sb_noss

When : Click on 2020 Premium HP Stream 11 Laptop Computer 11.6" HD WLED Anti- Glare Intel Celeron Processor N4000 4GB RAM 32GB eMMC Office 365 Personal USB-C WiFi HDMI Win 10 + iCarp HDMI Cable

Then: Should display price $349.00


26. Scenario: Detect Image

Given : Enter URl https://www.amazon.com/s?k=laptop&ref=nb_sb_noss

When : Click on 2020 Premium HP Stream 11 Laptop Computer 11.6" HD WLED Anti- Glare Intel Celeron Processor N4000 4GB RAM 32GB eMMC Office 365 Personal USB-C WiFi HDMI Win 10 + iCarp HDMI Cable

Then: Should display image of laptop


27. Scenario: Detect Operating system

Given : Enter URl https://www.amazon.com/s?k=laptop&ref=nb_sb_noss

When : Click on 2020 Premium HP Stream 11 Laptop Computer 11.6" HD WLED Anti- Glare Intel Celeron Processor N4000 4GB RAM 32GB eMMC Office 365 Personal USB-C WiFi HDMI Win 10 + iCarp HDMI Cable

Then: Should display operating system Windows 10


16 views0 comments

Recent Posts

See All

Headless Browser in Python

What is a headless browser? A headless browser can access any website but unlike normal browsers (which you currently use) nothing will appear on the screen. Everything is done on the backend side inv