Tag Archives: distributed systems

InfluxDB 0.9.0 released

June 11, 2015 Philip O'Toole Leave a comment

The first version of the 0.9.0 series of InfluxDB has been released. It’s alpha-quality software but all of us on the InfluxDB team are very excited to see the software reach this stage.

You can read more about the release on this blog post.

Reviewing Elasticsearch Cookbook

March 13, 2015 Philip O'Toole Leave a comment

I recently acted as one of the official technical reviewers for ElasticSearch Cookbook – Second Edition by Alberto Paro. Published by Packt Publishing, the book contains a large number of “recipes” for elasticsearch.

Continue reading Reviewing Elasticsearch Cookbook →

Book Review: Cassandra High Availability

February 3, 2015 Philip O'Toole Leave a comment

Packt recently asked me to review their new publication Cassandra High Availability, written by Robbie Strickland.

I’ve worked with Cassandra in the past — early designs of Loggly‘s 2nd generation Log analytics platform used Cassandra as its authoritative store for log data, but we ended up pulling it and using elasticsearch as both the store and search engine.

Continue reading Book Review: Cassandra High Availability →

Measure Everything

November 16, 2014 Philip O'Toole Leave a comment

Tomorrow I join the team at InfluxDB, something I’m really excited about. I’m really looking forward to coding in Go full-time — it’s a language with real promise, a nice clean tool chain, and a very active community.
Continue reading Measure Everything →

Replicating SQLite using Raft Consensus

September 2, 2014 Philip O'Toole 27 Comments

SQLite is a “self-contained, serverless, zero-configuration, transactional SQL database engine”. However, it doesn’t come with replication built in, so if you want to store mission-critical data in it, you better back it up. The usual approach is to continually copy the SQLite file on every change.

I wanted SQLite, I wanted it distributed, and I really wanted a more elegant solution for replication. So rqlite was born.

Continue reading Replicating SQLite using Raft Consensus →

Call me Definitely

July 14, 2014 Philip O'Toole Leave a comment

The creator of the network monitoring system Riemann, Kyle Kingsbury, has put together a comprehensive series of blog posts, on the fault-tolerance, high-availability, and general correctness of number of database and storage technologies. Of the technologies discussed I am most familiar with — elasticsearch and Apache Kafka — I found the posts to be a great read.

If you haven’t read them yet, you should check them out on his site.

InfluxDB and Grafana HOWTO

June 9, 2014 Philip O'Toole 14 Comments

This blog describes working with InfluxDB 0.8. InfluxDB 0.8 is no longer supported, and has been superseded by the 1.0 release.

I recently came across InfluxDB — it’s a time-series database built on LevelDB. It’s designed to support horizontal as well as vertical scaling and, best of all, it’s not written in Java — it’s written in Go. I was intrigued to say the least.

Continue reading InfluxDB and Grafana HOWTO →

Distributed Systems for Fun and Profit

February 6, 2014 Philip O'Toole Leave a comment

I came across a very readable paper on distributed systems — Distributed systems for fun and profit. I recommend it for anyone interested in learning more about distributed systems, and the challenges involved with designing, building, and operating distributed systems.

Infrastructure at Scale: Apache Kafka, Twitter Storm and elasticsearch

December 25, 2013 Philip O'Toole Leave a comment

AWS have posted the video online of Jim Nisbet’s and my talk at AWS:reinvent 2013. In it, Jim and I describe the system we built at Loggly, which uses Apache Kafka, Twitter Storm, and elasticseach, to build a high-performance log aggregation and analytics SaaS solution, running on AWS EC2.

Continue reading Infrastructure at Scale: Apache Kafka, Twitter Storm and elasticsearch →