top of page
Sanya

Gherkin Script for validating data from a Scraping tool

I Scrape Blog data from Numpy Ninja – Manually



Feature 1: Using a web scraping tool

In order to scrape blog data from https://www.numpyninja.com/

As a user

I want to use the tool to extract the required data

Scenario 1: Log on to Numpy Ninja website for data scraping

Given I am on the Scraper tool home screen

When I enter https://www.numpyninja.com/ in the URL field

And I click on start button

Then Numpy Ninja homepage https://www.numpyninja.com/ should be loaded

Scenario 2: Loading the Blog webpage with many blogs

Given I am on the Scraper tool home screen

And I click on start button

Then Numpy Ninja Blog webpage https://www.numpyninja.com/blog should be loaded

Scenario 3: Loading a particular Blog webpage

Given I am on the Scraper tool home screen

And I click on start button

Scenario 4: Capture the Title of the blog

When I click on the Title

Then show “Data Scraping – Octoparse 8 with data driven scenarios”

Scenario 5: Capture the Author of the blog

When I click on the Author

Then show “kspkalyani”

Scenario 6: Capture the Published_Time of the blog

When I click on the Published_Time

Then show “3 hours ago”

Scenario 7: Capture the Time_to_Read of the blog

When I click on the Time_to_Read

Then show “2 min read”

Scenario 8: Capture the Views of the blog

When I click on the Views

Then show “7 views”


II Scrape price of MacBook Pro 2020 from Amazon – Auto-detect mode



Feature 1: Using a web scraping tool

In order to scrape price data of MacBook Pro 2020 from www.amazon.com

As a user

I want to use the tool to extract the required data

Scenario 1: Log on to amazon website for data scraping

Given I enter www.amazon.com in the URL field

When I click on start button

Then Amazon homepage should be loaded

Scenario 2: Loading “MacBook Pro 2020” related data webpage from homepage

Given I am on Amazon homepage

When I type MacBook Pro 2020 in the search bar

And click on search button

Scenario 3: Loading “MacBook Pro 2020” related data webpage directly

When I click on start button

Scenario 4: Detect % web page data being loaded

Given I clicked start button

When the Amazon website is being loaded

Then a modal window should detect % web page data being loaded for infinite scroll

Scenario 5: Amazon website not getting loaded

Given I clicked start button

When the Amazon website is NOT being loaded

Then toggle browse button

And click Reload webpage

Scenario 6: Wrong webpage detected

Given I wrongly typed item in the search bar

When I click start button

Then I am not in the right page to extract required data

Scenario 7: Wrong webpage detected

Given I am not in the right page to extract required data

When I click on “Not the right webpage”

Then I can correct the wrongly typed item in the search bar


Feature 2: Validating the pre-selected elements by the tool

In order to validate the auto-detected data

As a user

I want to use the modal window

Scenario 1: To check auto detect is completed

Given I am on the home screen of the tool

When the Amazon website has finished loading in the built-in browser

Then modal window should show the message “Auto-detect completed”

Scenario 2: Detect the Title of the laptop

When detecting Title

Then show New Apple MacBook Pro with Apple M1 Chip (13-inch, 8GB RAM, 256GB SSD Storage) - Space Gray (Latest Model)

Scenario 3: Detect the Original Price of the laptop

When detecting Original Price

Then show $1,299.00

Scenario 4: Detect the Offer Price of the laptop

When detecting Offer Price

Then show $1,199.00

Scenario 5: Detect the blank Original Price of the laptop

When Original Price detected is blank

Then show blank field

Scenario 6: Detect the blank Offer Price of the laptop

When Offer Price detected is blank

Then show blank field

354 views

Recent Posts

See All
bottom of page