Use Cases
- Automate the collection of web content for research purposes
- Streamline content creation by gathering information from multiple sources
- Organize scraped webpage content in Google Drive for easy access
How It Works
Trigger the workflow manually to start the scraping process Fetch the sitemap URL to retrieve a list of website URLs Filter URLs based on specific topics or keywords Scrape each webpage using Jina.ai to extract titles and markdown content Save the extracted content as text files in Google Drive
Setup Steps
- 1Import the workflow template into your n8n instance
- 2Click on 'Test workflow' to initiate the process
- 3Set the sitemap URL in the 'Set Website URL' node
- 4Run the workflow to start scraping and saving content
Apps Used
Google Drive
Jina.ai
Categories
Target Roles
Tags
#content automation
#data extraction
#workflow management