Hive导入csv文件示例

正文

现有文件为csv格式,需要导入hive中,设csv内容如下

1 2	1001,zs,23 1002,lis,24

首先创建表

create table if not exists csv2(

uid int ,

uname string,

age int

)

row format serde 'org.apache.hadoop.hive.serde2.OpenCSVSerde'

stored as textfile ;

导入数据及查询

1 2	load data local inpath '/data/csv2.csv' into table csv2; select * from csv2;

其他注意事项

如果建表是parquet格式可否load导入csv文件?

								
									 drop   table   csv2; 

									 create   table   if   not   exists csv2( 

									       uid   int  , 

									       uname string, 

									       age   int 

									 ) 

									 row format serde   'org.apache.hadoop.hive.serde2.OpenCSVSerde' 

									 stored   as   parquet ; 

									 load   data   local   inpath   '/data/csv2.csv'   into   table   csv2; 

									 select   *   from   csv2;

使用时会报错

Failed with exception java.io.IOException:java.lang.RuntimeException: hdfs://192.168.10.101:8020/user/hive/warehouse/csv2/csv2.csv is not a Parquet file. expected magic number at tail [80, 65, 82, 49] but found [44, 50, 52, 10]

**不可以,需要先导入成textfile,之后再从临时表导入成parquet,**如下

								
									 drop   table   csv2; 

									 create   table   if   not   exists csv2 

									 ( 

									       uid     int  , 

									       uname string, 

									       age     int 

									 ) 

									       row format serde   'org.apache.hadoop.hive.serde2.OpenCSVSerde' 

									       stored   as   textfile; 

									 -- 先导入csv文件到表格csv2,保存格式是textfile 

									 load   data   local   inpath   '/data/csv2.csv'   into   table   csv2; 

									 drop   table   csv3; 

									 -- 创建csv3,保存格式parquet 

									 create   table   if   not   exists csv3 

									 ( 

									       uid     int  , 

									       uname string, 

									       age     int 

									 ) 

									       row format delimited 

									           fields terminated   by   ',' 

									       stored   as   parquet; 

									 -- 提取csv2的数据插入到csv3 

									 insert   overwrite   table   csv3   select   *   from   csv2;

总结

关键是要引入org.apache.hadoop.hive.serde2.OpenCSVSerde csv 要保存到 hive 的 parquet ,需要先保存成 textfile

以上就是Hive导入csv文件示例的详细内容，更多关于Hive导入csv文件的资料请关注服务器之家其它相关文章！

原文链接：https://blog.csdn.net/u010711495/article/details/111722382

查看更多关于Hive导入csv文件示例的详细内容...

声明：本文来自网络，不代表【好得很程序员自学网】立场，转载请注明出处：http://www.haodehen.cn/did231367

更新时间：2023-05-19 阅读：29次