我也想說很尷尬, 搞到顯現 發現 原來是個版本的bug
spark 1.6.0 有個BUG
希望更多人看到
Traceback (most recent call last):
File "<stdin>", line 1, in <module>
File "/Users/jzhang/github/spark/python/pyspark/sql/context.py", line 430, in createDataFrame
jdf = self._ssql_ctx.applySchemaToPythonRDD(jrdd.rdd(), schema.json())
File "/Users/jzhang/github/spark/python/pyspark/sql/context.py", line 691, in _ssql_ctx
"build/sbt assembly", e)
Exception: ("You must build Spark with Hive. Export 'SPARK_HIVE=true' and run build/sbt assembly", Py4JJavaError(u'An error occurred while calling None.org.apache.spark.sql.hive.HiveContext.\n', JavaObject id=o34))
雖然是一個很明確的錯誤,但是在網上找了好久都沒解決,最后居然發現是spark1.6.0的一個bug, 更新到1.6.1就沒這個問題了,我也是醉了,唉,還是要記得多多保持軟件的更新啊!!!
原文參考 http://shellbye.com/blog/tech_world/spark-bug-lead-to-error-note/
文章列表