了解用于Java的DOM解析器

问题描述:

下面是我想用DOM解析的XML,我的代码和输出。我需要从“简单数据”中获取信息,但我没有这么做。了解用于Java的DOM解析器

XML:

<kml> 
    <Document> 
    <Folder id="kml_ft_Meter_Rates_and_Time_Limits"> 
     <name>Meter_Rates_and_Time_Limits</name> 
     <Placemark id="kml_1"> 
     <name>$1.00/hr 2hr time limit</name> 
     <snippet> </snippet> 
     <description><![CDATA[<center><table><tr><th colspan='2' align='center'><em>Attributes</em></th></tr><tr bgcolor="#E3E3F3"> 
      <th>RATE</th> 
      <td>$1.00</td> 
      </tr><tr bgcolor=""> 
      <th>LIMIT</th> 
      <td>2hr</td> 
      </tr></table></center>]]> 
     </description> 
     <styleUrl>#ParkingMeterStyler_KMLStyler</styleUrl> 
     <ExtendedData> 
      <SchemaData schemaUrl="#Meter_Rates_and_Time_Limits"> 
      <SimpleData name="RATE">$1.00</SimpleData> 
      <SimpleData name="LIMIT">2hr</SimpleData> 
      </SchemaData> 
     </ExtendedData> 
     <LineString> 
      <coordinates>-123.100739208611,49.2630169018194,0 -123.100348847572,49.2630078055425,0 </coordinates> 
     </LineString> 
     </Placemark> 
    </Folder> 
    </Document> 
</kml> 

代码充满了sysouts用于调试目的:

 System.out.println("Root element :" + doc.getDocumentElement().getNodeName()); 
     System.out.println("Root 1st child :" + doc.getDocumentElement().getChildNodes().item(1).getNodeName()); 
     System.out.println("Document 1st child :" + doc.getDocumentElement().getChildNodes().item(1).getChildNodes().item(1).getNodeName()); 
     System.out.println("Document 2nd child :" + doc.getDocumentElement().getChildNodes().item(1).getChildNodes().item(2).getNodeName()); 
     System.out.println("Document 3rd child :" + doc.getDocumentElement().getChildNodes().item(1).getChildNodes().item(3).getNodeName()); 
     System.out.println("Document 4th child :" + doc.getDocumentElement().getChildNodes().item(1).getChildNodes().item(4).getNodeName()); 
     System.out.println("Document 5th child :" + doc.getDocumentElement().getChildNodes().item(1).getChildNodes().item(5).getNodeName()); 
     System.out.println("-----------------------"); 


     NodeList nList = doc.getElementsByTagName("Placemark"); 
     nList = nList.item(1).getChildNodes(); 
     System.out.println("Placemark list, 1st placemark 1st child :" + nList.item(1).getNodeName()); 
     System.out.println("Placemark list, 1st placemark 2nd child :" + nList.item(2).getNodeName()); 
     System.out.println("Placemark list, 1st placemark 3rd child :" + nList.item(3).getNodeName()); 
     System.out.println("Placemark list, 1st placemark 4th child :" + nList.item(4).getNodeName()); 
     System.out.println("-----------------------"); 
     System.out.println("Placemark list, 1st placemark 9th child :" + nList.item(9).getNodeName()); 
     System.out.println("-----------------------"); 
     nList = nList.item(9).getChildNodes(); 
     System.out.println("Extended data, 1st child :" + nList.item(1).getNodeName()); 
     System.out.println("-----------------------"); 
     System.out.println("Schema data, 1st child :" + nList.item(1).getChildNodes().item(1).getNodeName()); 
     System.out.println("Simple data :" + nList.item(1).getChildNodes().item(4).getNodeName()); 
     System.out.println("-----------------------"); 
     System.out.println("Schema data, 2nd child :" + nList.item(1).getChildNodes().item(3).getNodeName()); 
     System.out.println("Simple data :" + nList.item(1).getChildNodes().item(4).getNodeName()); 

控制台输出:

Root element :kml 
Root 1st child :Document 
Document 1st child :name 
Document 2nd child :#text 
Document 3rd child :visibility 
Document 4th child :#text 
Document 5th child :Style 
----------------------- 
Placemark list, 1st placemark 1st child :name 
Placemark list, 1st placemark 2nd child :#text 
Placemark list, 1st placemark 3rd child :snippet 
Placemark list, 1st placemark 4th child :#text 
----------------------- 
Placemark list, 1st placemark 9th child :ExtendedData 
----------------------- 
Extended data, 1st child :SchemaData 
----------------------- 
Schema data, 1st child :SimpleData 
Simple data :#text 
----------------------- 
Schema data, 2nd child :SimpleData 
Simple data :#text 
+0

您的节点“名称”是“文件夹”节点的子节点,您如何将它视为“文档”节点的子节点。与其他节点的问题一样,这就是为什么你在任何地方看到#text。 – Arham

+0

有2个名为name的节点,一个是文档的第一个孩子,另一个是地标的第一个孩子,但我不想发布完整的xml,因为它很长。实际上使这3种节点称为名称。 1)文件2的第1个孩子)文件夹的第1个孩子以及3)地标的第1个孩子 –

+0

项目索引从0开始而不是从1开始 – shyam

nList.item(0).getChildNodes().item(9).getChildNodes().item(1).getChildNodes().item(1).getTextContent() - >打印$1.00

nList.item(0).getChildNodes().item(9).getChildNodes().item(1).getChildNodes().item(3).getTextContent() - >打印2hr

这里nList用在这行后面NodeList nList = doc.getElementsByTagName("Placemark");。请相应地修复你的遍历。

+0

再次感谢Yogendra;),getTextContent()是我的主要问题。 –

+0

很高兴知道。谢谢! –

我不能肯定的你到底想要什么。也许再详细一点。

org.w3c.Node有一个方法getTextContent()。一般来说,这些w3c类使用item(i)的剧组,例如Element

要跳过空白文本(节点名称#文本),或只是更直接地访问特定元素,就使用XPath。

+0

我需要得到$ 1.00和2小时的字符串出来的形式: $ 1.00 2hr