SARAH

Data Engineer and Analyst - AI Enthusiast
28 数据/人工智能/机器学习/数据科学家住在 上海国籍 摩洛哥
分享

简介

Passionate Data Engineer and Analyst with a Master's in Software Engineering from Shanghai Jiao Tong University and solid experience in ETL pipelines, data warehousing, and real-time analytics. Skilled in Python, SQL, PostgreSQL, Dagster, Airflow, and data visualization. Previously contributed to aviation and product data projects, with a growing focus on AI and machine learning. Eager to apply data-driven solutions to real-world problems and contribute to innovative teams in onsite, hybrid, or remote environments.

工作经历

Data Engineer

Shanghai Feihao Property Management Co., Ltd.
2023.09-至今(2 年)
Led the overall design of databases to store, organize, and retrieve information with automated ETL pipelines using Python, RDBMS, SQL, Apache Airflow and Kafka, besides demonstrating strong communication skills in client presentations and negotiations, contributing to business growth and part- nerships.

Consultant Data Engineer

Admiral
2022.11-2023.05(7 个月)
November 2022 - Mai 2023 Built data pipelines that integrated information from diferent sources, including cleaning, transforming, and loading data into a central data warehouse. Automated and scheduled these ETL processes for consistent data aggregation. (Python, SQL, Hadoop, Tableau, Spark, Kafka)

Machine Learning Intern

Freebeat
2020.07-2021.01(7 个月)
Participated in building, training, testing, and deploying a complex recurrent neural network for Music Audio Analysis and Pattern Recognition using Python, with Tensorflow and Keras frameworks.

Artificial Intelligence Researcher - Academic

Shanghai Jiao Tong University - SEIEE
2019.09-2020.06(10 个月)
Worked with a team of 1 Ph.D. and 2 Master’s students on a computer vision task to build a classifi- cation model serving the medical field using Convolutional Neural Networks (CNNs), under the supervision of Dr. Yao Jianguo. The model achieved high accuracy in diagnosing arrhythmias from ECG time series.

项目

ETL for aviation data

Data Engineer
2024.10-2025.03(6 个月)
Developed an ETL pipeline for data from different sources like APIs, DBs, and flat files, involving data cleaning, normalization, and standardization processes, loaded the data into a centralized data warehouse, and automated and scheduled the overall pipeline. Python, Pandas, SQLAlchemy, PostgreSQL, SQL, Dagster, Github actions, CI/CD Pipeline Orchestration.

Full-stack web application

developer
2023.10-2024.01(4 个月)
Developed a full-stack web application for the presentation of the company and user registration functionality as a training task, handling the front end, back end, database management, and containerization. React, Node.js, Express, Postgres, JavaScript, Docker.

Machine learning and deep learning on audio data

Assistant AI engineer
2020.08-2020.10(3 个月)
Optimized a deep learning model for music beat and downbeat detection using an Artificial Neural Networks, combined with a multi-resolution spectrogram processor. Python, SciPy/NumPy. The developed deep learning model outperformed existing models by 12% in accuracy, was successfully implemented in production, and contributed to attracting more investors to the company’s product.

AI on images

Developer
2018.12-2019.02(3 个月)
Developed a model to visualize image data and its semantic relationships, for tasks like object detection and image annotation. Python, Keras.

E-commerce mobile application

Developer
2017.05-2017.06(2 个月)
Developed, with a team of 4, a mobile Android application for e-commerce, enabling user account registration, login functionality, product input with detailed specifications, and exploration of product catalogs. Java, SQLite, RESTful API, Android Studio.

教育经历

Shanghai Jiao Tong University

Software Engineering.
2019.09-2022.09(3 年)
Master of Software Engineering. Overall GPA: 3.5/4.00 Relevant courses: Algorithm Analysis and Theory, Web search and mining, Data visualization, Computer networks, Internet of things. Scientific research direction: Deep Learning, Artificial Intelligence, Machine Learning.

Hassan II University of Casablanca

Mathematics & Computer Science.
2014.09-2017.06(3 年)
Bachelor of Science in Mathematics & Computer Science. Relevant courses: Data Structures & Algorithms, Database Management Systems, Oriented Object Programming, Web Development, Networks programming, Systems programming, Functional programming, Mobile App Development.

语言

英语
精通
法语
母语
阿拉伯语
母语
中文(普通话)
一般

证书

Introduction to Relational Databases
2023.08
Convolutional Neural Networks certificate
2021.05
Sequences, Time Series and Prediction
2020.05
Neural Networks and Deep Learning certificate
2020.02
Machine Learning
2020.01
IELTS
2019.03
Chinese language proficiency HSK3
2018.06

技能

International Trading as a part time
Content Creation
Driving License in China
搜索简历
国籍
职位类别
城市或国家
职位
人才
博客
我的