Files
Data-Analytics/Day_001_Setup_and_Data_Ingestion.md
tejaswini 90435d8ab7 Day 1 — Setup and Data Ingestion_updated
Day 1 — Setup and Data Ingestion_updated
2025-10-09 06:46:19 +00:00

1.4 KiB
Raw Permalink Blame History

📅 Day 1 — Setup and Data Ingestion

🎯 Goal

Establish the foundation of the project.
Create an n8n workflow that successfully fetches raw sales and product review data from their respective sources.


🧩 Tasks

1. Environment Setup

  • Get your n8n instance running (n8n Cloud, Docker, or local install).
  • Set up a Python environment:
    pip install pandas nltk
    
  1. Create a New n8n Workflow Start with a Manual Trigger node (Start).

This will be replaced with an automated schedule on Day 5.

  1. Fetch Sales Data Add an HTTP Request node (or Database node like PostgreSQL).

Connect to your e-commerce platforms API (e.g., Shopify /orders.json).

Set up authentication (API Key, OAuth2, etc.).

Test the node to ensure it pulls recent orders.

  1. Fetch Product Reviews Add another HTTP Request node.

Connect to your website API or third-party service to fetch reviews.

Test independently to ensure data retrieval.

  1. Combine Data Streams Add a Merge node.

Connect Sales Data and Review Data nodes.

Set Mode → Combine.

This ensures both data sets are available for the next step.

Deliverable A manually triggered n8n workflow that:

Pulls raw data from two sources (sales + reviews).

Merges the data into a single workflow run.

💡 Solution Workflow successfully merges sales and review data.

Outputs a combined JSON object, ready for Day 2 Python processing.