Tag Archives: operations

Monitoring: it’s not just for production

May 23, 2018 Philip O'Toole Leave a comment

Monitoring — the measurement of your system, the gathering of telemetry, and alerting when it behaves anomalously — is key to running large-scale, modern computer systems. But what many developers today don’t realise is that monitoring can be a key part of your design cycle too.

Continue reading Monitoring: it’s not just for production →

Stop asking "how much data do you have?"

November 29, 2016 Philip O'Toole Leave a comment

database In every field there is a question that, while it sounds interesting, betrays a naiveté and lack of sophistication.

In my field — SaaS and data platforms — it’s how much data do you have?

Continue reading Stop asking "how much data do you have?" →

rqlite v3: Globally replicating SQLite

May 3, 2016 Philip O'Toole 4 Comments

rqlite is an open-source distributed relational database, which uses SQLite as its storage engine. rqlite is written in Go and uses Raft to achieve consensus across a set of SQLite databases. It gracefully handles leader election, and can tolerate machine failure.

With the v3 release series, rqlite can now replicate SQLite databases on a global scale, with very little effort. Let’s see it in action using the AWS EC2 cloud.

Continue reading rqlite v3: Globally replicating SQLite →

InfluxDB and the Raft consensus protocol

December 16, 2015 Philip O'Toole Leave a comment

I recently presented at the InfluxDB San Francisco Meetup, on InfluxDB and the Raft consensus protocol. My talk was about the fundamental problems of distributed systems, and how InfluxDB uses Raft to solve these issues.

Continue reading InfluxDB and the Raft consensus protocol →

Designing a search system for log data — part 3

December 7, 2015 Philip O'Toole Leave a comment

This is the last part of a 3-part series “Designing and building a search system for log data”. Be sure to check out part 1 and part 2.

In the last post we examined the design and implementation of Ekanite, a system for indexing log data, and making that data available for search in near-real-time. Is this final post let’s see Ekanite in action.

Continue reading Designing a search system for log data — part 3 →

Designing a search system for log data — part 2

December 1, 2015 Philip O'Toole Leave a comment

This is the second part of a 3-part series “Designing and building a search system for log data”. Be sure to check out part 1. Part 3 follows this post.

In the previous post I outlined some of the high-level requirements for a system that indexed log data, and makes that data available for search, all in near-real-time. Satisfying these requirements involves making trade-offs, and sometimes there are no easy answers.

Continue reading Designing a search system for log data — part 2 →

Designing a search system for log data — part 1

November 22, 2015 Philip O'Toole Leave a comment

This is the first part of a 3-part series “Designing and building a search system for log data”. Part 2 is here, and part 3 is here.

For the past few years, I’ve been building indexing and search systems, for various types of data, and often at scale. It’s fascinating work — only at scale does O(n) really come alive. Developing embedded systems teaches you how computers really work, but working on search systems and databases teaches you that algorithms really do matter.

Continue reading Designing a search system for log data — part 1 →

Who watches the watchers?

September 22, 2015 Philip O'Toole Leave a comment

I’ve written my first post for the InfluxDB blog. In it I discuss the new statistics and monitoring system built into InfluxDB, starting with the 0.9.4 release. Functionality like this is critical when it comes to running a distributed database like InfluxDB.

You can check it out here.