Job Summary:
ParkourSC is a high-growth emerging software company. We focus every day on building unmatched value for our customers and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and future-focused work. We offer a highly collaborative, caring team environment with a strong focus on learning and development, recognition for individual contributions, and a variety of benefits.
Here at ParkourSC, we are looking for a Data Engineer to join our Advanced Analytics team focused on delivering optimizations, predictions, anomaly detection, and simulations to support our supply chain operations solutions. As a Data Engineer, this user will be responsible for developing analytical models to support key components of the ParkourSC application.
Technical Skills/Qualifications:
- Strong hands-on experience in data processing, ETL development, and building scalable data pipelines
- Proficiency in SQL, including complex queries, performance tuning, and development of UDFs
- Experience with Apache Spark and PySpark for large-scale data processing
- Working knowledge of Databricks platform and distributed data processing environments
- Experience with batch (bulk loads) and streaming data ingestion frameworks
- Familiarity with data modeling concepts (star/snowflake schemas, normalization/denormalization)
- Experience working with relational databases (e.g., PostgreSQL, MySQL) and exposure to NoSQL databases is a plus
- Understanding of data validation, data quality checks, and pipeline testing practices
- Programming skills in Python and ability to write reusable, maintainable code
- Exposure to cloud platforms (AWS, Azure, or GCP) and cloud-based data services is a plus
- Familiarity with version control systems (e.g., Git) and CI/CD practices is preferred
- Strong problem-solving skills and attention to detail
Responsibilities:
- Design, develop, and maintain ETL processes and scalable data pipelines for batch and streaming workloads
- Implement data ingestion processes including bulk data loads and near real-time streaming integrations
- Develop and optimize SQL queries, stored procedures, and UDFs to support analytics and business reporting
- Build and maintain data processing jobs using Apache Spark and PySpark within Databricks or similar environments
- Collaborate with data analysts, data scientists, and cross-functional teams to understand data requirements and deliver reliable datasets
- Perform data cleansing, transformation, and validation to ensure high data quality and integrity
- Monitor and troubleshoot pipeline performance issues, ensuring reliability and scalability
- Maintain clear technical documentation for data workflows, schemas, transformations, and operational processes
- Support deployment, testing, and ongoing maintenance of data solutions in development and production environments
- Continuously improve data engineering practices by adopting best practices in performance optimization, testing, and automation
Experience:
- 2–5 years of experience in data engineering, data integration, or related roles
Culture:
- Detail-oriented, collaborative, and growth-focused environment with opportunities to work on modern data platforms and scalable data architectures
Company Description:
ParkourSC digitized supply chain operations to improve resilience, increase agility, and drive strategic innovation. Our digital supply chain operations platform is powered by next-generation technologies such as hyper-scale graph modeling, AI/ML, and massive real-time data ingestion from IoT and other systems and signals. Customers use ParkourSC to create intelligent digital twins of their supply chain, continuously align planning and execution, foster collaboration across the extended enterprise, and increase profitability by delivering new technology-enabled products, ensuring quality, compliance and sustainability, and eliminating millions of dollars of waste.
For more information, visit: www.ParkourSC.com
ParkourSC d.o.o.
ParkourSC was founded on a compelling vision: take a radical new approach to unlock the massive value within supply chains, and transform them into more powerful, data-driven strategic assets. In a world of increasing business disruption, unpredictability, and customer expectations, continuous visibility and continuous intelligence within the supply chain is critical to a company’s growth and a necessary strategic advantage. We are shaping the present and future of supply chains by creating…