数据导入Apache Solr到Datastax Solr - 日期格式转换器DIH

问题描述:

我想用DIH将数据从Apache Solr导入到Datastax Solr。我能够获取文档,但是当二氢尝试创建文件,我得到的日期字段,提示以下错误:数据导入Apache Solr到Datastax Solr - 日期格式转换器DIH

org.apache.solr.common.SolrException: Invalid Date String:'Thu Jun 08 16:23:00 PDT 2017' 
at org.apache.solr.schema.DateField.parseMath(DateField.java:182) 
at com.datastax.bdp.search.solr.core.types.V1TypeMapper.formatToCassandraType(V1TypeMapper.java:166) 
at com.datastax.bdp.search.solr.core.types.V2TypeMapper.formatToCassandraType(V2TypeMapper.java:101) 
at com.datastax.bdp.search.solr.Cql3CassandraRowWriter.write(Cql3CassandraRowWriter.java:170) 
at com.datastax.bdp.search.solr.handler.update.CassandraDirectUpdateHandler.addDoc(CassandraDirectUpdateHandler.java:161) 
at org.apache.solr.update.processor.RunUpdateProcessor.processAdd(RunUpdateProcessorFactory.java:69) 
at org.apache.solr.update.processor.UpdateRequestProcessor.processAdd(UpdateRequestProcessor.java:51) 
at org.apache.solr.update.processor.DistributedUpdateProcessor.versionAdd(DistributedUpdateProcessor.java:595) 
at org.apache.solr.update.processor.DistributedUpdateProcessor.processAdd(DistributedUpdateProcessor.java:435) 
at org.apache.solr.update.processor.UpdateRequestProcessor.processAdd(UpdateRequestProcessor.java:51) 
at org.apache.solr.update.processor.AbstractDefaultValueUpdateProcessorFactory$DefaultValueUpdateProcessor.processAdd(AbstractDefaultValueUpdateProcessorFactory.java:94) 
at org.apache.solr.handler.dataimport.SolrWriter.upload(SolrWriter.java:70) 
at org.apache.solr.handler.dataimport.DataImportHandler$1.upload(DataImportHandler.java:235) 
at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:504) 
at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:408) 
at org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:323) 
at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:231) 
at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:411) 
at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:476) 
at org.apache.solr.handler.dataimport.DataImporter$1.run(DataImporter.java:457) 

我用我DIH的日期格式转换,但似乎没有效果就可以了。在创建要在Datastax Solr中编制索引的文档之前,DateFormat转换器会将日期文件从Apache Solr转换为'dateTimeFormat'中指定的格式。

<dataConfig> 
    <document> 
    <entity name="sep_byOrderNumber" processor="SolrEntityProcessor" query="OrderNumber:${dataimporter.request.OrderNumberList}" rows="${dataimporter.request.batchSize}" url="${dataimporter.request.urlSource}" transformer="DateFormatTransformer" loglevel="debug"> 
     <field column="OrderCreateDate" dateTimeFormat="yyyy-MM-dd'T'HH:mm:ss.SSSZ" /> 
    </entity> 
    </document> 
</dataconfig> 

有人可以帮忙找出问题吗?

您dateTimeFormat更改为:EEE MMM dd HH:mm:ss z yyyy

Letter Date or Time Component    Presentation Examples 
y  Year        Year   1996; 96 
M  Month in year (context sensitive) Month  July; Jul; 07 
d  Day in month      Number  10 
E  Day name in week     Text   Tuesday; Tue 
H  Hour in day (0-23)     Number  0 
m  Minute in hour      Number  30 
s  Second in minute     Number  55 
z  Time zone       Time zone Pacific Standard Time; PST; GMT-08:00 

来源:http://docs.oracle.com/javase/8/docs/api/java/text/SimpleDateFormat.html

+0

谢谢你,但我仍然面临着同样的错误。我的改性DIH-config.xml的是如下: '' – Preethi

+0

您是否重新加载核心或重新启动solr并尝试重新索引数据? –

+0

我部署了新的dih配置,重新加载了内核并启动了dih导入。它提取文件,但由于上述错误未能索引,处理或跳过。 – Preethi