My favorites | Sign in
Google
Projects on Google Code Results 1 - 10 of 19
The goal of this project is twofold: to solve the problem of using the streaming API to load the Wikipedia articles dump in a HBase instance and to provide code examples for HBase. You are free to modify, adapt, and redistribute this code and suggestions are very welcomed.
==What is HBase-Writer?== HBase-Writer is an extension to the Heritrix open source crawler written by the Internet Archive (http://crawler.archive.org/) that enables it to store crawled content directly into HBase tables (http://hbase.org/) running on the Hadoop Distributed FileSystem (http://hadoo...
= !AppScale = !AppScale is a platform that allows users to deploy and host their own [http://code.google.com/appengine/ Google App Engine] applications. It executes automatically over [http://aws.amazon.com/ec2 Amazon EC2] and [http://open.eucalyptus.com/ Eucalyptus] as well as [http://www.xen.o...
Neptune is another open source project implementing Google's Bigtable. Neptune has the following features. * Basic data service * Single row operation(put, get) * Multi row operation(Scanner) * Data uploader(DirectUploader) * MapReduce(TabletInputFormat) * NQL(Neptune Qu...
The goal of the Hadoop UI project is to provide an intuitive, powerful and accessible client for the [http://hadoop.apache.org/ Hadoop] map reduce framework. The following features have been implemented: * HDFS Explorer: file manager for distributed file system (HDFS) * Job Manager: Monitor a...
A system that can store very large number of images in Hbase and retrieve each one using the image id with reasonable response time
Hadoop/Hbase based climate large data set project for CS 4960-001 at the University of Utah, Fall 2009.
Implementation of the [http://code.google.com/appengine Google App Engine] [http://code.google.com/appengine/docs/python/datastore/ Datastore] in [http://java.sun.com/javase/ Java 6] using (initially) [http://hadoop.apache.org/hbase hbase] and [http://hadoop.apache.org/core/ hadoop] as the [http://l...
a distributed, high performance and scalability simple struct database system!
Companies such as Google, Yahoo and Amazon have realised infrastructure capable of rapidly indexing and mining huge volumes of data in a manner tolerant of hardware failure and accommodating continued growth. Until recently, the underlying technologies were a closely guarded secret, but emergin...
1 2 Next