Loading career path...
A practical, India-focused roadmap to become a Data Engineer. Covers core skills (Python, SQL), ETL/ELT, data modelling, data lakes & warehouses, batch & streaming (Spark, Kafka), orchestration (Airflow), cloud services (AWS/GCP/Azure), observability, and production best-practices. Free-first resources are provided; paid courses are optional with free alternatives.
Gain strong foundation in Python, Linux, SQL, basic data modelling, and version control.
Focus on scripting, file I/O, working with CSV/JSON, pandas for ETL prototyping, and packaging scripts.
Strong SQL skills: joins, window functions, subqueries, indexes, transactions and write optimized queries.
Comfort with shell scripting, cron, file permissions, and basic networking.
Git workflows, branching, PRs, code reviews and CI basics.
Help us improve this roadmap for future learners. Your insights help us build the most accurate career paths.
Request Improvement / missing step