Creating Scalable Systems

- September 12, 2011

Some food for thought. Consider the following:

Prefer BASE over ACID transactions
Prefer asynchronous over synchronous transactions
Keeping state is expensive
Considering database sharding (highscalability, codefutures, Pros and Cons) by data, by transaction or by customer but avoid premature optimisation
Design the system for automated rollback
Create isolative structures; share nothing; such that nothing crosses the swimlanes
Design systems for failure
Create idempotent services where possible

Database sharding requires changes in mindset:

Tables may need to be denormalised to optimise sharding (as well as to workaround cross-shard joins/ queries)
Scale-out instead of scale-up
Do away with replication where possible

Different sharding schemes are:

Vertical partitioning – sometimes known as functional or feature partitioning where data relating to certain entities are grouped together. Different functions or features are put onto different shards.
Range-based partitioning – data for a certain function/ feature/ entity is sharded using ranges (such range may be based on year, location, etc.)
Hash-based partitioning – data for a certain function/ feature/ entity is sharded using a hash function (modulo operation)

Database sharding presents a number of issues:

Data needs to be rebalanced from time-to-time
Joining data from multiple shards (cross-shard join) is expensive
Referential integrity is now an issue since referential data may now be in a different database
Sharding is relatively new; no body of knowledge and lack of support

Comments

Anonymous said…

Hi

Great post. Some remarks on Sharding - it is possible to do cross-shard joins, so no need for schema changes. Just use an off-the-shelf tool for that - like ScaleBase (disclosure - I work there). They give you transparent database sharding.

16/09/2011, 05:59

Search This Blog

SOFTware is HARD

Creating Scalable Systems

Comments

Popular posts from this blog

Understanding ITIL Service Management the UML way…

Modelling the Life Insurance New Business Process

How to depict (Professional-Looking) Logical Network Diagrams in Astah