28 Dec 2008 20:50
[jira] Commented: (PIG-569) Inconsistency with Hadoop in Pig load statements involving globs with subdirectories
[
https://issues.apache.org/jira/browse/PIG-569?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12659462#action_12659462
]
Tom White commented on PIG-569:
-------------------------------
Hadoop has supported this only since 0.19 (see HADOOP-3498, PIG-252). Are you using Hadoop 0.19? You could
be getting this error since Pig is using the libraries from Hadoop 0.17 or 0.18.
> Inconsistency with Hadoop in Pig load statements involving globs with subdirectories
> ------------------------------------------------------------------------------------
>
> Key: PIG-569
> URL: https://issues.apache.org/jira/browse/PIG-569
> Project: Pig
> Issue Type: Bug
> Components: impl
> Affects Versions: types_branch
> Environment: FC Linux x86/64, Pig revision 724576
> Reporter: Kevin Weil
> Fix For: types_branch
>
>
> Pig cannot handle LOAD statements with Hadoop globs where the globs have subdirectories. For example,
> A = LOAD 'dir/{dir1/subdir1,dir2/subdir2,dir3/subdir3}' USING ...
> A similar statement in Hadoop, hadoop dfs -ls dir/{dir1/subdir1,dir2/subdir2,dir3/subdir3}, does
work correctly.
> The output of running the above load statement in pig, built from svn revision 724576, is:
(Continue reading)
RSS Feed