Database-Engineering

Tablet Sizing Strategies

Tablet Sizing Strategies

Valerie Parham-Thompson
Modern distributed databases split large tables into tablets to enable parallel processing and efficient data distribution. Finding the right tablet size impacts everything from query performance to operational overhead. Let’s explore how to approach tablet sizing systematically to achieve optimal performance.
Audit Logging

Audit Logging

YugabyteDB Friday Tech Talk on fuzzy matching for string searches

Valerie Parham-Thompson
Gearing up for my next YFTT presentation next month. It will be on fuzzy matching, a chance to show out some neat string search features.
Fuzzy matching in YugabyteDB

Fuzzy matching in YugabyteDB

Techniques for matching similar strings in YugabyteDB

Valerie Parham-Thompson
Fuzzy string matching in YugabyteDB can be done with wildcard lookups, phonetic algorithms (Soundex, Metaphone), and trigram similarity. I’ll show a demo of practical examples using artist names, highlighting the performance differences between wildcard searches and phonetic indices. A combination of indexed double metaphone and trigram methods works best for both speed and precision. Also, while YugabyteDB supports PostgreSQL-style extensions, some indexing optimizations behave differently due to its distributed storage layer.
Audit Logging in YugabyteDB | YugabyteDB Friday Tech Talks

Audit Logging in YugabyteDB | YugabyteDB Friday Tech Talks

Yugabyte's session-level and object-level audit logging improves security and compliance

Valerie Parham-Thompson
Audit logging is essential for tracking the “who, what, when, and where” of database access and changes, supporting both security and compliance requirements. It helps organizations know who accessed or modified data, schemas, roles, or grants.
Optimizing YugabyteDB Memory Tuning for YSQL

Optimizing YugabyteDB Memory Tuning for YSQL

Learn how to configure YugabyteDB memory for YSQL workloads by adjusting process ratios and key performance parameters

Valerie Parham-Thompson
Memory configuration in YugabyteDB for YSQL workloads involves partitioning among the tserver, master, and postgres processes, each with default ratios. Adjusting these ratios based on workload characteristics helps avoid out-of-memory events. Monitoring memory usage is crucial, and tuning parameters like max_connections, work_mem, and temp_file_limit can optimize both performance and resource utilization.
Best Practices for Monitoring YugabyteDB

Best Practices for Monitoring YugabyteDB

Operational visibility is key to understanding your database across dimensions of uptime, performance, and capacity planning

Valerie Parham-Thompson
At DSS 2021, I provided a comprehensive orientation to monitoring YugabyteDB, focusing on how to interpret and leverage built-in metrics for operational visibility.
Fireside Chat with Kroger at the Distributed SQL Summit

Fireside Chat with Kroger at the Distributed SQL Summit

Kroger - Examining their Two-Year Distributed SQL Journey & What's Next

Valerie Parham-Thompson
The founder of YugabyteDB and I discussed Kroger’s multi-year journey modernizing its technology infrastructure using distributed SQL to support its large-scale retail operations with the Kroger VP of Customer Technology at DSS 2021.
Automation for Girl Scout Events

Automation for Girl Scout Events

Streamline Girl Scout event planning and management with automation tools for communications, scheduling, and tracking

Valerie Parham-Thompson
Automation tools can streamline the planning, registration, and management of Girl Scout events, saving time for troop leaders. Practical examples might include automating communications, scheduling, or badge tracking. This approach helps modernize event management and reduce administrative overhead, making it ideal for volunteers looking to simplify their workflows.
Migrating Oracle Workloads to Google Cloud – BMS

Migrating Oracle Workloads to Google Cloud – BMS

Learn how to migrate Oracle databases to Google Cloud's Bare Metal Solution while minimizing rewrites and downtime

Valerie Parham-Thompson
Migrating Oracle workloads to Google Cloud’s Bare Metal Solution (BMS) offers benefits like reduced rewrites, familiar hardware, and simplified licensing. Challenges include server sizing, OS changes, and database upgrades. Careful planning and consolidation are key for large databases, and BMS is well-suited for organizations aiming to minimize downtime and risk during migration.
ApacheCon, Upgrading Cassandra Using Automation

ApacheCon, Upgrading Cassandra Using Automation

Upgrades using cstar - A presentation from ApacheCon @Home 2020

Valerie Parham-Thompson
I recently did an upgrade of 200+ nodes of Cassandra across multiple environments sitting behind multiple applications using the cstar tool. I chose the cstar tool because, out of all automation options, it has topology awareness specific to Cassandra. I will share my experience with this upgrade, including observations and surprises, as well as a walk-through of the process using a Cassandra cluster provisioned in Docker.