Modern distributed databases split large tables into tablets to enable parallel processing and efficient data distribution. Finding the right tablet size impacts everything from query performance to operational overhead. Let’s explore how to approach tablet sizing systematically to achieve optimal performance.
Fuzzy string matching in YugabyteDB can be done with wildcard lookups, phonetic algorithms (Soundex, Metaphone), and trigram similarity. I’ll show a demo of practical examples using artist names, highlighting the performance differences between wildcard searches and phonetic indices. A combination of indexed double metaphone and trigram methods works best for both speed and precision. Also, while YugabyteDB supports PostgreSQL-style extensions, some indexing optimizations behave differently due to its distributed storage layer.
Audit logging is essential for tracking the “who, what, when, and where” of database access and changes, supporting both security and compliance requirements. It helps organizations know who accessed or modified data, schemas, roles, or grants.
Memory configuration in YugabyteDB for YSQL workloads involves partitioning among the tserver, master, and postgres processes, each with default ratios. Adjusting these ratios based on workload characteristics helps avoid out-of-memory events. Monitoring memory usage is crucial, and tuning parameters like max_connections, work_mem, and temp_file_limit can optimize both performance and resource utilization.
At DSS 2021, I provided a comprehensive orientation to monitoring YugabyteDB, focusing on how to interpret and leverage built-in metrics for operational visibility.
The founder of YugabyteDB and I discussed Kroger’s multi-year journey modernizing its technology infrastructure using distributed SQL to support its large-scale retail operations with the Kroger VP of Customer Technology at DSS 2021.
Automation tools can streamline the planning, registration, and management of Girl Scout events, saving time for troop leaders. Practical examples might include automating communications, scheduling, or badge tracking. This approach helps modernize event management and reduce administrative overhead, making it ideal for volunteers looking to simplify their workflows.
Migrating Oracle workloads to Google Cloud’s Bare Metal Solution (BMS) offers benefits like reduced rewrites, familiar hardware, and simplified licensing. Challenges include server sizing, OS changes, and database upgrades. Careful planning and consolidation are key for large databases, and BMS is well-suited for organizations aiming to minimize downtime and risk during migration.
I recently did an upgrade of 200+ nodes of Cassandra across multiple environments sitting behind multiple applications using the cstar tool. I chose the cstar tool because, out of all automation options, it has topology awareness specific to Cassandra. I will share my experience with this upgrade, including observations and surprises, as well as a walk-through of the process using a Cassandra cluster provisioned in Docker.