Distributed data processing: Apache
Spark (performance optimization on large datasets) Data storage & query optimization: AWS... (including use of Pandas, PySpark). Expertise in SQL, Apache
Spark, Airflow, AWS S3, Athena, and Parquet file formats. Proven... -
Voir cette offre d'emploi