我的名为'numers.csv'的csv文件的编号排列为
3,5,6
1,7,8
2,2,4
我的python文件有以下代码
#!/usr/bin/env python2
import apache_beam as beam
import csv
p = beam.Pipeline(options=PipelineOptions())
pipeline=beam.Pipeline('DirectRunner')
(
pipeline
|beam.io.ReadFromText('numbers.csv')
|beam.Map(lambda line:csv.reader([line]))
|beam.io.WriteFromText('output.csv')
)
result=pipeline.run()
result.wait_until_finish()
它应该获取数字并将其写入output.csv文件,但我收到错误
AttributeError:'module'对象没有属性'Pipeline'
当我改变我的python代码导入apache-beam我得到错误
将apache-beam导入为beam
SyntaxError:语法无效
所以把它更改为'apache_beam'再次强调,当我运行python代码时,我收到错误“AttributeError:'module'对象没有属性'Pipeline'”
附: 1)我已经安装了apache_beam包
要求已经满足:.tov / lib / python2.7 / site-packages中的setuptools(来自protobuf <= 3.3.0,> = 3.2.0-> apache_beam)
2)我也做了pip install six == 1.10.0根据堆栈溢出的一些问题要求已经满足:6 == 1.10.0 in ./env/lib/python2.7/site-packages
3)最后,pip install google-cloud-dataflow它说要求已经满足了 .
请帮忙