From fd484fe4318070fb039cb0d00809a1a227ecaf43 Mon Sep 17 00:00:00 2001 From: tejaswini Date: Mon, 6 Oct 2025 06:40:13 +0000 Subject: [PATCH] Day_001_Setup_and_Data_Ingestion.md MIME-Version: 1.0 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit Day 1 — Setup and Data Ingestion --- Day_001_Setup_and_Data_Ingestion.md | 57 +++++++++++++++++++++++++++++ 1 file changed, 57 insertions(+) create mode 100644 Day_001_Setup_and_Data_Ingestion.md diff --git a/Day_001_Setup_and_Data_Ingestion.md b/Day_001_Setup_and_Data_Ingestion.md new file mode 100644 index 0000000..2982446 --- /dev/null +++ b/Day_001_Setup_and_Data_Ingestion.md @@ -0,0 +1,57 @@ +# 📅 Day 1 — Setup and Data Ingestion + +## 🎯 Goal +Establish the foundation for the project. +The primary objective is to create an **n8n workflow** that successfully fetches raw **sales** and **product review data** from their respective sources. + +--- + +## 🧩 Tasks + +### 1. Environment Setup +- Set up your **n8n instance**: + - Options: n8n Cloud, Docker, or local installation. + - Reference: [https://docs.n8n.io/hosting/](https://docs.n8n.io/hosting/) +- Create a **Python environment**. + - Install necessary libraries: + ```bash + pip install pandas nltk + ``` + +--- + +### 2. Create a New n8n Workflow +- Start with a **Manual Trigger** node (`Start`). +- This will later be replaced with an automated schedule on **Day 5**. + +--- + +### 3. Fetch Sales Data +- Add an **HTTP Request** or **Database Node (PostgreSQL/MySQL)**. +- Connect to your e-commerce API endpoint (e.g., Shopify → `/orders.json`). +- Set up credentials (API Key, OAuth2, etc.). +- Test and verify that recent order data (e.g., order_id, product_id, price, quantity) is fetched successfully. + +--- + +### 4. Fetch Product Reviews +- Add another **HTTP Request** node. +- Configure it to pull recent **product reviews** (e.g., review_text, rating). +- Test independently to ensure successful data retrieval. + +--- + +### 5. Combine Data Streams +- Add a **Merge Node**. +- Connect both sales and review nodes. +- Set **Mode** → `Combine`. +- This ensures both data sets are merged and available for the next phase. + +--- + +## ✅ Deliverable +A manually triggered **n8n workflow** that: +- Pulls raw data from **two sources** (sales + reviews). +- Combines them using a **Merge node**. +- Outputs unified JSON data ready for analysis. +