我正在尝试使用JSOUP从HTML URL源中提取div标签中的文本,这是:

<div class="some_text">
    <strong>Lorem ipsum</strong> dolor sit amet, consectetur adipiscing elit, 
&nbsp;
sed do eiusmod tempor incididunt ut labore et dolore magna aliqua.

Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat. </div>

使用此代码:

Document document = Jsoup.connect(url).get();           
Elements description = document.select("div[class=some_text]");
String getText = description.text();

输出是:

Lorem ipsum dolor sit amet,consectetur adipiscing elit,sed do eiusmod tempor incididunt ut labore et dolore magna aliqua . Ut enim ad minim veniam,quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat .

正如您所看到的那样,空格和强标记不会反映在文本的输出上,而是
, &nbsp; and <strong>
. 我想要的是包含空格和大胆强调文本输出 . 怎么做到这一点?

我想要的输出是:

Lorem ipsum dolor sit amet,consectetur adipiscing elit,sed do eiusmod tempor incididunt ut labore et dolore magna aliqua . Ut enim ad minim veniam,quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat .