Working Student - Software Engineer
- Migrated client systems from InfluxDB v1 to v3, streamlining data handling and reducing computation overhead from data type conversions. Introduced test cases in Pytest to run in a containerized environment.
- Developed a prototype compaction routine to address the small Parquet file problem in S3 buckets, optimizing data storage. Utilized S3 tables with PyIceberg, custom AWS Glue jobs with SQS, RDS, and Lambda (Docker runtime), leveraging Terraform for infrastructure as code. Conducted detailed analysis of implemented tools and runtimes, cost, maintenance overhead, and query performance.
- Redesigned the database schema to support multiple data storage granularities, preventing InfluxDB database count limits for large-scale battery device data. Enhanced Spring Boot APIs and improved unit and integration test coverage.