使用Solr 6.2.1

我有一个本地MySQL数据库,我希望Solr为其查询索引 .

我创建了一个名为 create-test 的核心,在运行 ...dataimport?command=full-import 之后,我得到以下状态:

<response>
<lst name="responseHeader">
    <int name="status">0</int>
    <int name="QTime">0</int>
</lst>
<lst name="initArgs">
    <lst name="defaults">
        <str name="config">osm-dih.xml</str>
    </lst>
</lst>
<str name="command">status</str>
<str name="status">idle</str>
<str name="importResponse" />
<lst name="statusMessages">
    <str name="Total Requests made to DataSource">1</str>
    <str name="Total Rows Fetched">230750</str>
    <str name="Total Documents Processed">0</str>
    <str name="Total Documents Skipped">0</str>
    <str name="Full Dump Started">2016-11-16 20:08:42</str>
    <str name="">Indexing completed. Added/Updated: 0 documents. Deleted 0 documents.</str>
    <str name="Committed">2016-11-16 20:08:44</str>
    <str name="Time taken">0:0:1.448</str>
</lst>

I am having trouble with the fact that rows were fetched, but no documents were added.

这是我在 osm-dih.xml 的dataconfig:

<dataConfig>
  <dataSource name="mysql"
              driver="com.mysql.jdbc.Driver"
              url="jdbc:mysql://localhost:3306/osm"
              user="osm"
              password="Start123"/>
  <document>
    <entity name="way" dataSource="mysql" query="select way_id, way_tags from osm_way">
      <field column="way_id" name="osm_id"/>
      <field column="way_tags" name="way_tags"/>
    </entity>
  </document>
</dataConfig>

非常基本的实体 .

由于状态中提取的行数与数据库中的行数相匹配,因此我假设数据库连接和查询正在运行 .

从我的搜索中,我看到有些人在 schema.xml 文件中有其他属性,但我没有在DIH Solr示例或official Solr cwiki for DIH中看到类似的内容 . 我猜测Solr版本之间可能存在差异 .

有谁知道为什么要提取行但没有添加文件?


Edit 1 我运行了详细的调试,这是原始调试响应的开始:

{
  "responseHeader": {
    "status": 0,
    "QTime": 1495
  },
  "initArgs": [
    "defaults",
    [
      "config",
      "osm-dih.xml"
    ]
  ],
  "command": "full-import",
  "mode": "debug",
  "documents": [],
  "verbose-output": [
    "entity:way",
    [
      "document#1",
      [
        "query",
        "select way_id, way_tags from osm_way",
        "time-taken",
        "0:0:1.382",
        null,
        "----------- row #1-------------",
        "way_tags",
        "{\"name\": \"Mount Royal\", \"lanes\": \"2\", \"highway\": \"tertiary\"}",
        "way_id",
        2627409,
        null,
        "---------------------------------------------"
      ],
      "document#1",
      [
        null,
        "----------- row #1-------------",
        "way_tags",
        "{\"name\": \"Longfellow\", \"lanes\": \"2\", \"highway\": \"residential\", \"surface\": \"asphalt\"}",
        "way_id",
        2627414,
        null,
        "---------------------------------------------"
      ],
...

看起来每行都被处理为 document#1 ,但 documents 数组为空 .