首页 文章

Spark和Cassandra的SBT应用程序 - 类路径中缺少符号'type <none>.package.DataFrame'

提问于
浏览
1

我正在尝试创建简单的Apache Spark应用程序,它将使用Datastax Cassandra连接器连接到Cassandra并执行一些操作并收到错误

Symbol 'type <none>.package.DataFrame' is missing from the classpath.

我的 build.sbt

name := "spark-app"
version := "1.0"
scalaVersion := "2.11.11"


libraryDependencies ++= Seq(
  "com.datastax.spark" %% "spark-cassandra-connector" % "2.0.0",
  "org.apache.spark" %% "spark-core" % "2.1.1" % "provided"
)

resolvers += "Spark Packages Repo" at "https://dl.bintray.com/spark-packages/maven"

我的简单应用:

package com.budgetbakers.be.dwh.spark
import com.datastax.spark.connector._
import org.apache.spark.{SparkConf, SparkContext}

object Distinct {
  def main(args: Array[String]): Unit = {
    val conf = new SparkConf(true)
      .set("spark.cassandra.connection.host", "127.0.0.1")

    val sc = new SparkContext(conf)
    println(sc.cassandraTable("ks", "users").select("gender").distinct().collect().mkString(","))
    sc.stop()
  }
}

当我尝试 package 项目时,我得到以下编译错误:

[error] /.../Distinct.scala:18: Symbol 'type <none>.package.DataFrame' is missing from the classpath.
[error] This symbol is required by 'value com.datastax.spark.connector.package.dataFrame'.
[error] Make sure that type DataFrame is in your classpath and check for conflicting dependencies with `-Ylog-classpath`.
[error] A full rebuild may help if 'package.class' was compiled against an incompatible version of <none>.package.
[error]     println(sc.cassandraTable("ks", "users").select("gender").distinct().collect().mkString(","))
[error]             ^

Am I missing something? Maybe there is some dependency conflict?

我使用的应用程序版本:

  • cassandra:3.1

  • apache spark:2.1.1

  • spark cassandra connector:2.0.0

  • scala:2.11

  • sbt:0.13.15

  • sbt程序集插件:0.14.0

1 回答

  • 4

    尝试添加 spark-sql 依赖项以及核心库 . 供将来参考,有示例构建文件here

相关问题