Database Management: Administer, optimize, and maintain ClickHouse, MongoDB, and MariaDB databases.
Database Architecture & Design: Develop efficient schemas, indexing strategies, query optimization techniques, sharding, and replication strategies.
Performance Tuning: Optimize queries, allocate resources, and implement caching mechanisms for maximum performance.
Database Administration: Ensure database security, compliance, monitoring, backup, and recovery strategies.
Data Orchestration & ETL/ELT: Work with Apache Airflow, Luigi, Prefect, Dagster to streamline data pipelines.
High-Velocity Data Streaming: Collaborate with ETL engineers to ensure efficient ingestion of real-time data into ClickHouse, including schema compatibility and ingestion performance tuning.
ClickHouse Optimization: Design schemas, optimize analytical queries on multi-billion row datasets, and implement data retention and roll-up strategies using background merges and view hierarchies.
DevOps & Automation (Preferred): Work with Docker, Kubernetes, Terraform to deploy and manage database infrastructure.
Collaboration: Work closely with cross-functional teams, including Data Engineers, Software Developers, and DevOps, to implement robust and scalable database solutions.
Databases: ClickHouse, MongoDB, MariaDB
Database Optimization: Query tuning, indexing, replication, sharding
Programming: SQL, Python, Bash scripting
Data Orchestration: Apache Airflow, Luigi, Prefect, Dagster
Streaming & ETL: Clickhouse Kafka Table Engine
ClickHouse Internals: Data parts, merges, materialized views, TTLs, partitions
Security & Compliance: Backup, monitoring, access control