| Projects on Google Code | Results 1 - 10 of 19 |
The goal of this project is twofold: to solve the problem of using the streaming API to load the Wikipedia articles dump in a HBase instance and to provide code examples for HBase. You are free to modify, adapt, and redistribute this code and suggestions are very welcomed.
==What is HBase-Writer?==
HBase-Writer is an extension to the Heritrix open source crawler written by the Internet Archive (http://crawler.archive.org/) that enables it to store crawled content directly into HBase tables (http://hbase.org/) running on the Hadoop Distributed FileSystem (http://hadoo...
= !AppScale =
!AppScale is a platform that allows users to deploy and host their own [http://code.google.com/appengine/ Google App Engine] applications. It executes automatically over [http://aws.amazon.com/ec2 Amazon EC2] and [http://open.eucalyptus.com/ Eucalyptus] as well as [http://www.xen.o...
AppScale,
Hypertable,
HBase,
HDFS,
Hadoop,
AppEngine,
CloudComputing,
MySQL,
Cassandra,
Voldemort,
MongoDB,
MemcacheDB
Neptune is another open source project implementing Google's Bigtable.
Neptune has the following features.
* Basic data service
* Single row operation(put, get)
* Multi row operation(Scanner)
* Data uploader(DirectUploader)
* MapReduce(TabletInputFormat)
* NQL(Neptune Qu...
The goal of the Hadoop UI project is to provide an intuitive, powerful and accessible client for the [http://hadoop.apache.org/ Hadoop] map reduce framework.
The following features have been implemented:
* HDFS Explorer: file manager for distributed file system (HDFS)
* Job Manager: Monitor a...
A system that can store very large number of images in Hbase and retrieve each one using the image id with reasonable response time
Hadoop/Hbase based climate large data set project for CS 4960-001 at the University of Utah, Fall 2009.
Implementation of the [http://code.google.com/appengine Google App Engine] [http://code.google.com/appengine/docs/python/datastore/ Datastore] in [http://java.sun.com/javase/ Java 6] using (initially) [http://hadoop.apache.org/hbase hbase] and [http://hadoop.apache.org/core/ hadoop] as the [http://l...
a distributed, high performance and scalability simple struct database system!
Companies such as Google, Yahoo and Amazon have realised infrastructure capable of rapidly indexing and mining huge volumes of data in a manner tolerant of hardware failure and accommodating continued growth. Until recently, the underlying technologies were a closely guarded secret, but emergin...