We are seeking a highly skilled and experienced Data Engineer to join our team. The successful candidate will be responsible for building, maintaining, and optimizing our ETL pipelines. You will play a key role in ensuring our data architecture supports our rapid growth and enables us to extract meaningful insights from complex data sets.
Key Responsibilities:
• Design, build, and maintain scalable and reliable ETL pipelines to support data integration from various sources.
• Develop and manage databases using BigQuery, MySQL & Pinecone, ensuring data integrity, security, and performance.
• Collaborate with cross-functional teams, to gather requirements and deliver data solutions that support business initiatives.
• Implement data warehousing solutions and data modeling practices to support advanced analytical and reporting capabilities.
• Optimize data flow and collection to improve data accuracy and value.
• Ensure compliance with data governance and data security requirements.
• Monitor and troubleshoot performance issues on the data pipelines and databases.
• Stay up-to-date with industry standards and advancements in data engineering practices and technologies.
Qualifications:
• Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field.
• Minimum of 3 years of experience in a Data Engineering role.
• Proficient in SQL and experience with database management systems, particularly BigQuery and MySQL.
• Demonstrable experience with Shopify’s REST & GraphQL APIs.
• Experience with data pipeline and workflow management tools.
• Strong understanding of ETL techniques and best practices.
• Proficient in one or more programming languages (Python preferred)
• Experience with cloud services (e.g., AWS, Google Cloud Platform) and understanding of cloud-based ETL services.
• Excellent problem-solving skills and attention to detail.
• Strong communication and collaboration abilities to work with team members and stakeholders. Nice to Have:
• Experience with data visualization tools and dashboard development.
• Experience with vector databases
• Knowledge of machine learning and statistical modeling is a plus