coalchan / TPC-H

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

TPC-H

资源

  1. TPC-H from http://www.tpc.org/tpch/ version: 2.18.0 been uploaded to baidu pan(code: cnyx)
  2. TPC-H_on_Hive by Yuntao Jia: https://issues.apache.org/jira/browse/HIVE-600

操作

  1. cd dbgen && cp makefile.suite makefile

  2. 修改 makefile 108 行

vi makefile

################
## CHANGE NAME OF ANSI COMPILER HERE
################
CC      = gcc
# Current values for DATABASE are: INFORMIX, DB2, TDAT (Teradata)
#                                  SQLSERVER, SYBASE, ORACLE, VECTORWISE
# Current values for MACHINE are:  ATT, DOS, HP, IBM, ICL, MVS,
#                                  SGI, SUN, U2200, VMS, LINUX, WIN32
# Current values for WORKLOAD are:  TPCH
DATABASE= HIVE
MACHINE = LINUX
WORKLOAD = TPCH
  1. 在 tpcd.h 增加 Hive 对应方式

vi tpcd.h

#ifdef HIVE
#define GEN_QUERY_PLAN "EXPLAIN"
#define START_TRAN "START TRANSACTION"
#define END_TRAN "COMMIT"
#define SET_OUTPUT ""
#define SET_ROWCOUNT "limit %d;\n"
#define SET_DBASE "use %s;\n"
#endif
  1. 执行 make 命令

make

  1. 生成数据

./dbgen -s 1

说明:-s 代表数据规模因子,1 代表产生 1G 的数据量,在当前文件夹下生成的8个 .tbl 文件就是我们所需要的数据。

问题

  1. 在 macos 上出现 fatal error: 'malloc.h' file not found

#include <malloc.h> 改为 #include <sys/malloc.h>

About


Languages

Language:C 80.3%Language:Shell 6.7%Language:Objective-C 4.7%Language:Makefile 3.4%Language:Perl 3.3%Language:TSQL 1.4%Language:C++ 0.3%