Files
Data-Analytics/Day_001_Setup_and_Data_Ingestion.md
tejaswini 90435d8ab7 Day 1 — Setup and Data Ingestion_updated
Day 1 — Setup and Data Ingestion_updated
2025-10-09 06:46:19 +00:00

56 lines
1.4 KiB
Markdown
Raw Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

# 📅 Day 1 — Setup and Data Ingestion
## 🎯 Goal
Establish the foundation of the project.
Create an **n8n workflow** that successfully fetches raw sales and product review data from their respective sources.
---
## 🧩 Tasks
### 1. Environment Setup
- Get your **n8n instance** running (n8n Cloud, Docker, or local install).
- Set up a **Python environment**:
```bash
pip install pandas nltk
2. Create a New n8n Workflow
Start with a Manual Trigger node (Start).
This will be replaced with an automated schedule on Day 5.
3. Fetch Sales Data
Add an HTTP Request node (or Database node like PostgreSQL).
Connect to your e-commerce platforms API (e.g., Shopify /orders.json).
Set up authentication (API Key, OAuth2, etc.).
Test the node to ensure it pulls recent orders.
4. Fetch Product Reviews
Add another HTTP Request node.
Connect to your website API or third-party service to fetch reviews.
Test independently to ensure data retrieval.
5. Combine Data Streams
Add a Merge node.
Connect Sales Data and Review Data nodes.
Set Mode → Combine.
This ensures both data sets are available for the next step.
✅ Deliverable
A manually triggered n8n workflow that:
Pulls raw data from two sources (sales + reviews).
Merges the data into a single workflow run.
💡 Solution
Workflow successfully merges sales and review data.
Outputs a combined JSON object, ready for Day 2 Python processing.