Experience with web crawlers, large-scale data processing workflows is a plus. This role is responsible for all aspects of data collection to support our model……
You'll work closely with senior data engineers to process, clean, and transform large datasets, ensuring data integrity and availability for analytics and……
We're looking for coding specialists who live and breathe algorithms, data structures, software architecture, frontend and backend development, cloud……
Collaborate with data scientists and ML engineers to enable scalable AI solutions. 3+ years of hands-on experience in data engineering, with a focus on big data……
Build and manage scalable data lakes, data warehouses, and real-time data pipelines. Ensure data quality, lineage, governance, and compliance across data……
Individuals under 18 (eighteen) years of age; The successful candidate will be responsible for ensuring the quality and reliability of our App through a……
$ 2.900.000 - $ 4.300.000 (Proporcionado por el empleador)
Candidatura rápida
Creating and improving data processing workflows. Exposure to AI, LLM, or data-related projects. Collaborating with engineers, technical leads, and clients to……
Additionally, we are responsible for collecting and analyzing data to optimize product quality. This specific team is responsible for the functional……
Lead and grow a team of software engineers contributing to Backblaze cloud services, spanning both customer-facing features and underlying infrastructure.…
Estamos desarrollando un sistema inteligente de control que combine contexto operativo, análisis situacional y visión en tiempo real para tomar decisiones……
Implement, manage, and optimize data processing and machine learning inference pipelines on our servers. We are looking for a talented software engineer to help……
The ideal candidate will have a strong background in data engineering, cloud-based architectures, and proficiency in implementing data pipelines to transform……
We're looking for Python coding specialists who live and breathe algorithms, data structures, software architecture, frontend and backend development, cloud……
Build and optimize data storage solutions — data lakes and data warehouses — for efficient retrieval and processing. Deliver solutions on time and within scope.…
Proactively identifies hidden problems and patterns in data and uses these insights to drive improvements to coding hygiene and system architecture.…
Students with at least 18 months remaining until graduation (graduating in December 2027 or later). Build with Large Language Models (LLMs): Your daily work……
Integrarte a un equipo de datos para diseñar, desarrollar y operar pipelines ETL/ELT en Azure, garantizando la ingesta, transformación y disponibilidad de datos……
Experiencia de al menos 5 años en roles comerciales del segmento empresas (excluyente). Atención y asesoramiento: Brindar atención personalizada a compañías de……
At least 5 years of hands-on software development experience, with proven ability to own complex projects and consistently deliver within deadlines.…
Collaboration and Communication: Strong interpersonal skills to collaborate effectively with stakeholders, data engineers, data scientists, and other cross-……
Experience with Kafka, CDC, and enterprise data integration platforms. This role blends deep database engineering expertise with SRE principles to ensure……
$ 3.595 - $ 4.314 (Proporcionado por el empleador)
Candidatura rápida
Review heatmaps, session recordings, and user behavior data. You will work directly with our Digital Marketing Manager, Engineering Team, and Operations Team to……
Experience with data quality frameworks and observability tools for data pipelines. Collaborate with data scientists and security researchers to understand data……
Bluelight is a leading software consultancy dedicated to designing and developing innovative technology that enhances users' lives. With a steadfast commitment to delivering exceptional service to our clients, Bluelight excels in its focus on quality and customer satisfaction. Our mission is not only to create cutting-edge applications but also to foster a collaborative and enriching work environment where each team member can grow and thrive. With a presence across the United States and Central/South America, Bluelight is in an exciting phase of expansion, continually seeking exceptional talent to join its dynamic and diverse community.
As an ETL Data Engineer, you will play a critical role in our client’s expanding data engineering team, designing, developing, and maintaining data integration processes primarily using Python (PySpark) and Azure Synapse Analytics to ensure the accuracy and availability of analytical data. Working closely with data scientists, analysts, and other stakeholders to deliver high-quality data for insights and decision-making, this position is ideal for a passionate software development professional who thrives in a fast-paced, dynamic environment where everyone's opinions and efforts are valued. By joining our client’s growing software consultancy, you will have the opportunity to contribute to challenging, market-standing projects within a collaborative community that deeply values hard work, continuous learning, personal growth, and professional development.
Responsibilities
ETL Data Engineering: Develop and maintain ETL data engineering processes using Python (PySpark) within Azure Synapse Analytics Notebooks, and/or Azure Synapse Analytics Pipelines, to ensure efficient data extractions, transformation, and loading.
Data Warehousing: Apply your expertise in data warehousing, understanding star schemas, facts, and dimensions, to design and build effective data storage structures in a Massively Parallel Processing (MPP) SWL Pool.
Data Source Expertise: Extract data from various sources, including REST APIs, SWL database tables, and CSV files.
Azure Synapse Analytics Expertise: Utilize your deep knowledge of Azure Synapse Analytics to design and optimize data notebooks/pipelines for scalability and performance.
Data Fabric Concepts: Contribute to the implementation and understanding of other Data Fabric concepts, such as data lakes, lakehouses, delta lakes, and data cataloging, to enhance data management capabilities.
Data Modeling: Collaborate with data architects to create data models and schemas that align with business requirements.
Data Quality: Implement data quality checks and validation processes to maintain data accuracy and consistency.
Performance Tuning: Identify and resolve performance bottlenecks and optimize ETL data notebooks/pipelines to meet SLAs.
Monitoring and Troubleshooting: Monitoring ETL jobs, diagnose issues, and implement solutions to ensure data pipeline reliability.
Documentation: Maintain comprehensive documentation of ETL data engineering processes, data flows, and data transformations.
Collaboration: Work closely with cross-functional teams to understand data requirements and provide support for data-related initiatives.
Security and Compliance: Ensure data security and compliance with data governance and privacy standards.
Qualifications
Bachelor’s degree in Computer Science, Information Technology, or a related field; or equivalent work experience, with certifications related to data engineering or data science (e.g. Azure Data Engineer) being a plus.
Proven experience in ETL data engineering with significant expertise in using Python (PySpark) to perform data extraction, transformation, and loading from REST APIs, SQL database tables, and CSV files.
Proficiency in using Azure Synapse Analytics resources including Notebooks, Pipelines, Linked Services, and Azure Key Vault.
Demonstrated ability to write complex SQL queries, optimize query performance, and work with both SparkSQL and MS SQL to effectively extract, transform, and load data.
Knowledge of data integration best practices and tools.
Experience with version control systems, such as Git (Azure DevOps).
Strong problem-solving and analytical skills, with a keen attention to detail.
Excellent communication skills, both verbal and written, with the ability to work collaboratively in a team environment with shifting priorities.
Familiarity with big data technologies, machine learning, and data analysis preferred.
Experience with data visualization tools (e.g. Power BI, Tableau) and Agile Methodologies a plus.
Being a consultant in our team is a fun, challenging, and rewarding career choice. Your contributions are highly valued by clients, and the work you do often has a direct and significant impact on their business.
You will have the opportunity to work on a variety of projects for our incredible clients, which will accelerate your career growth. You’ll collaborate with modern technologies and work alongside some of the best professionals in the industry!
If you’re eager to be part of an exciting, challenging, and rapidly growing consultancy, we encourage you to apply. #LI-Remote