3 Unstar Star 2 Fork 0

科学大数据开源社区 / 多元数据库查询系统-simbaBSD-2-Clause

Create your Gitee Account
Explore and code with more than 5 million developers,Free private repositories !:)
Sign up
多元数据库查询系统-simba spread retract

Clone or download
liliang authored ....
Cancel
Notice: Creating folder will generate an empty file .keep, because not support in Git
Loading...
README.md

simba

insert, extraction and analysis framework for LDM

#Notice 1: scala version should be compatible for the system and the Spark

  1. spark 1.3.1
  2. scala 2.10.4
  3. hadoop 1.2.1
  4. titan 1.0.0

#Notice 2: assume lib in simba home contains following libs hadoop-client-1.2.1.jar
hadoop-gremlin-3.0.1-incubating.jar
hbase-common-0.98.2-hadoop1.jar
htrace-core-2.04.jar hadoop-core-1.2.1.jar
hbase-client-0.98.2-hadoop1.jar
hbase-protocol-0.98.2-hadoop1.jar or you need to include these libs through modifying the build.sbt

#Notice 3: (for titan)

  1. conf contains "conf/titan-hbase-es-simba.properties" configuration file for TitanDB(hbase+es in default)
  2. test_input contains the docs and links data and can be accessed as val docRDD = sc.objectFileDocument val linkRDD = sc.objectFileDocumentLink

compile####

sbt clean compile

run

sbt run

test

sbt test

#Simple Example: var gDB = TitanSimbaDB(sc, titanConf) val docRDD = sc.objectFileDocument gDB.insert(docRDD) gDB.docs().foreach(s => s.simbaPrint()) gDB.close()

Comments ( 0 )

Sign in for post a comment

1
https://git.oschina.net/opensci/simba.git
git@git.oschina.net:opensci/simba.git
opensci
simba
多元数据库查询系统-simba
master

Search

132457 8cb2edc1 1899542 131848 70c8d3a4 1899542