By Rich Cargile on February 5th, 2013 // No Comments
Hadoop is a new database technology that allows organizations to store massive amounts of unstructured data. It also allows for the creation of distributed applications that run in parallel across nodes against this unstructured data. Hadoop is an open source framework licensed by Apache, and was pioneered by large “Data” companies such as Facebook, Google, and Yahoo.
The Hadoop platform consists of two key technologies:
Hadoop Distributed File System (HDFS)
HDFS is a distributed file...
Continue Reading
By Rich Cargile on September 18th, 2012 // No Comments
xVelocity is in-memory columnar storage technology that powers the analytical engine of PowerPivot. Introduced as VertiPaq in the SQL Server 2008 R2 release, it was re-released for SQL Server 2012 in PowerPivot and SQL Server Analysis Services.
The xVelocity columnstore index feature is used to enable high performance query processing in SQL Server data warehouses and data marts. Some reports claim that the ColumnStore Index feature can produce performance boosts of 10-100x for some queries.
...
Continue Reading
By Rich Cargile on June 12th, 2012 // No Comments
Organizations rarely start out with a complicated data infrastructure. Small organizations often start out with just a very small-scale system, perhaps only a few key applications.
These systems must exchange data, but since only a few systems are involved, the data exchange processes between these applications are easier to implement and manage.
Figure 1: Phase 1 of the Corporate Information Factory
However, as the organization grows and matures, new systems are intro...
Continue Reading
By Rich Cargile on July 21st, 2010 // No Comments
When processing cubes, you have a couple of different choices about how to process your cube. One method is use an SSIS package with CubeDimension Processing tasks.
Another method to process your cube is to use a stored procedure.
Below is a simple SQL Server stored procedure that you can call to process an OLAP cube using T-SQL.
Create Procedure ProcessCube
@Database varchar(100),
@Cube varchar(100),
@Partition varchar(100) = null, -- If NULL, process the entire Cube
@Serve...
Continue Reading
By Rich Cargile on October 22nd, 2009 // No Comments
The Change Data Capture feature is a new feature in SQL Server 2008 that allows processes toquickly and easily identify changes to a table in SQL Server 2008. It primarily works by watching the SQL server transaction logs, and running processes that pull changes from the logs and moving them into tables.Change Data Capture SetupEnabling Change Data Capture is a two step process. First, you must enable it at the database level, then you must enable it for each table for which changes must be reco...
Continue Reading