使用Jsoup从div中提取文本
问题描述:
使用此代码,应用程序应该提取网站div的文本并将其显示在屏幕上,但这不会发生,并且[并且在Logcat中没有显示任何错误,我是什么做错了?使用Jsoup从div中提取文本
package com.androidbegin.jsouptutorial;
import java.io.IOException;
import java.io.InputStream;
import org.jsoup.Jsoup;
import org.jsoup.nodes.Document;
import org.jsoup.select.Elements;
import android.os.AsyncTask;
import android.os.Bundle;
import android.app.Activity;
import android.app.ProgressDialog;
import android.view.View;
import android.view.View.OnClickListener;
import android.widget.Button;
import android.widget.TextView;
public class MainActivity extends Activity {
TextView txtdesc;
// URL Address
String url = "http://uat.sophiejuliete.com.br/tendencias/";
ProgressDialog mProgressDialog;
@Override
public void onCreate(Bundle savedInstanceState) {
super.onCreate(savedInstanceState);
setContentView(R.layout.activity_main);
// Locate the Buttons in activity_main.xml
Button titlebutton = (Button) findViewById(R.id.titlebutton);
txtdesc = (TextView) findViewById(R.id.desctxt);
// Capture button click
titlebutton.setOnClickListener(new OnClickListener() {
public void onClick(View arg0) {
// Execute Title AsyncTask
new Title().execute();
}
});
}
private class Title extends AsyncTask<Void, Void, String> {
@Override
protected void onPreExecute() {
super.onPreExecute();
mProgressDialog = new ProgressDialog(MainActivity.this);
mProgressDialog.setTitle("Android Basic JSoup Tutorial");
mProgressDialog.setMessage("Loading...");
mProgressDialog.setIndeterminate(false);
mProgressDialog.show();
}
@Override
protected String doInBackground(Void... params) {
String desc = null;
try {
// Connect to the web site
Document document = Jsoup.connect(url).get();
// Using Elements to get the Meta data
Elements description = document.select("div[class=postWrapper]");
// Locate the content attribute
desc = description.text();
} catch (IOException e) {
e.printStackTrace();
}
return desc;
}
@Override
protected void onPostExecute(String result) {
// Set description into TextView
txtdesc.setText(result);
mProgressDialog.dismiss();
}
}
}
这是网站,你需要分析结构:
<div class="postWrapper" id="post162">
<div class="postTitle">
<h2>
<a href="http://uat.sophiejuliete.com.br/tendencias/agarradinhos-as-orelhas/">
Agarradinhos às orelhas </a>
</h2>
<div class="fb-custom-share" data-url="http://uat.sophiejuliete.com.br/tendencias/agarradinhos-as-orelhas/">
Compartilhar
</div>
<div class="date">
26 de janeiro de 2015 </div>
</div>
<div class="postContent"><p>Agarradinhos às orelhas, os solitários e brincos curtos são ideais tanto para o dia como para a noite.</p>
<p>E melhor ainda ficam bem em qualquer formato de rosto.</p>
<p>Basta apenas escolher o modelo conforme a ocasião que você vai utilizar.</p>
<p> </p>
<p><a href="http://sophiejuliete.com.br/shop/brincos.html"><img style="display: block; margin-left: auto; margin-right: auto;" src="http://uat.sophiejuliete.com.br/media/wysiwyg/Agarradinhos_s_orelhas.jpg" alt=""></a></p></div>
</div>
答
尝试
desc = description.text();
,而不是
desc = description.attr("postContent");
例子:
public static void main(String[] args) throws Exception {
String url = "http://uat.sophiejuliete.com.br/tendencias/";
Document document = Jsoup.connect(url).timeout(10000).get();
// Using Elements to get the Meta data
Elements description = document.select("div[class=postContent]");
// Locate the content attribute
String desc = description.text();
System.out.println(desc);
// prints out "Agarradinhos às orelhas, os solitários e brincos..."
}
UPDATE
由于JSoup部分是固定的,你可能有异步任务的一些问题。尝试使用String
作为结果类型,像这样
private class Title extends AsyncTask<Void, Void, String> {
...
@Override
protected String doInBackground(Void... params) {
String desc = null;
try {
// Connect to the web site
Document document = Jsoup.connect(url).get();
// Using Elements to get the Meta data
Elements description = document.select("div[class=postContent]");
// Locate the content attribute
desc = description.text();
} catch (IOException e) {
e.printStackTrace();
}
return desc;
}
@Override
protected void onPostExecute(String result) {
// Set description into TextView
TextView txtdesc = (TextView) findViewById(R.id.desctxt);
txtdesc.setText(result);
mProgressDialog.dismiss();
}
}
更新2
声明txtdesc
全球范围内,在MainActivity
TextView txtdesc;
在onCreate()
txtdesc = (TextView) findViewById(R.id.desctxt);
初始化0
并删除onPostExecute()
声明,所以只有txtdesc.setText(result);
@Override
protected void onPostExecute(String result) {
// Set description into TextView
txtdesc.setText(result);
mProgressDialog.dismiss();
}
我改变了它 DESC = description.attr( “postContent”);这就是为什么, desc = description.text(); 但它没有奏效! – 2015-02-24 13:51:58
适用于我,也许你有你的代码的其他问题,但JSoup部分是好的。我已经添加了一个小例子,你可以检查它是否适用于你? – 2015-02-24 13:55:28
我是计划的新手,如何以及在哪里适合这段代码片段,您可以将其放在我的发布代码上吗? – 2015-02-24 13:59:41