Spaces:
Sleeping
Sleeping
| # `SERPent` | |
| ## SERP results scrapping | |
| SERPent exposes an unified API to query SERP (Search Engine Result Pages) for a few common search engines, namely: | |
| - DuckDuckGo | |
| - Brave | |
| - Bing | |
| - Google Patents | |
| - arXiv | |
| The application uses the `playwright` library to control a headless web browser, to simulate normal user activity, to fool the anti-bot measures often present on those sites. See the `/serp/` endpoints for search results scrapping. | |
| ## Website sources scrapping | |
| SERPent also exposes a few endpoints to scrap the contents of certain sources (patents, scholar). See the `/scrap/` endpoints for supported website sources scrapping. | |