Sam Madden: September 2007 Archives

In regards to an earlier post about the advantages of compression in column-oriented databases, a commenter asked about maintaining the correspondence between columns in a column-oriented database. Sam Madden discusses how the issue isn't really "getting the data back in the original order" as much as it is ensuring that the system has a way of accessing records in other columns that correspond to a particular record in a sorted column. Continue reading "Follow up on compression post: Columns, indices, and sorting" »

One of the key performance features of column databases that Mike mentioned in his previous post was the aggressive use of data compression. In this post, we'll discuss how column-oriented databases are able to more effectively exploit compression than a typical row-oriented system. There are two key points: 1) Columns of data from the same attribute compress better than rows of tuples in a table; and 2) A well-architected database engine using appropriate compression techniques can operate directly on the compressed data, without decompressing it. Continue reading "Good things come in small packages: The advantage of compression in column databases" »

About this Archive

This page is a archive of recent posts written by Sam Madden in September 2007.

Sam Madden: October 2007 is the next archive.

Find recent content on the main index or look in the archives to find all content.