Java EscapeMode类代码示例

OGeek|极客世界-中国程序员成长平台 › 门户 › 编程› Java›Java编程经验

原作者: [db:作者] 来自: [db:来源] 收藏邀请

本文整理汇总了Java中org.jsoup.nodes.Entities.EscapeMode类的典型用法代码示例。如果您正苦于以下问题：Java EscapeMode类的具体用法？Java EscapeMode怎么用？Java EscapeMode使用的例子？那么恭喜您, 这里精选的类代码示例或许可以为您提供帮助。

EscapeMode类属于org.jsoup.nodes.Entities包，在下文中一共展示了EscapeMode类的10个代码示例，这些例子默认根据受欢迎程度排序。您可以为喜欢或者感觉有用的代码点赞，您的评价将有助于我们的系统推荐出更棒的Java代码示例。

示例1: cleanContent

import org.jsoup.nodes.Entities.EscapeMode; //导入依赖的package包/类
/**
 * Cleans the html content leaving only the following tags: b, em, i, strong, u, br, cite, em, i, p, strong, img, li, ul, ol, sup, sub, s
 * @param content html content
 * @param extraTags any other tags that you may want to keep, e. g. "a"
 * @return
 */
public String cleanContent(String content, String ... extraTags) {
	Whitelist allowedTags = Whitelist.simpleText(); // This whitelist allows only simple text formatting: b, em, i, strong, u. All other HTML (tags and attributes) will be removed.
	allowedTags.addTags("br", "cite", "em", "i", "p", "strong", "img", "li", "ul", "ol", "sup", "sub", "s");
	allowedTags.addTags(extraTags);
	allowedTags.addAttributes("p", "style"); // Serve per l'allineamento a destra e sinistra
	allowedTags.addAttributes("img", "src", "style", "class"); 
	if (Arrays.asList(extraTags).contains("a")) {
		allowedTags.addAttributes("a", "href", "target"); 
	}
	Document dirty = Jsoup.parseBodyFragment(content, "");
	Cleaner cleaner = new Cleaner(allowedTags);
	Document clean = cleaner.clean(dirty);
	clean.outputSettings().escapeMode(EscapeMode.xhtml); // Non fa l'escape dei caratteri utf-8
	String safe = clean.body().html();
	return safe;
}

开发者ID:xtianus，项目名称:yadaframework，代码行数:23，代码来源:YadaWebUtil.java

示例2: extractPackSection

import org.jsoup.nodes.Entities.EscapeMode; //导入依赖的package包/类
public String extractPackSection() {	
	mDoc = Jsoup.parse(mHtmlStr);
	mDoc.outputSettings().escapeMode(EscapeMode.xhtml);	
	
	String pack_section = "";
	// Find all information between X and X+1
	Element start_elem = mDoc.select("p:contains(Packungen)").first();			
	Element stop_elem = mDoc.select("p:contains(Zulassungsinhaberin)").first();	
	// Alternative:
	/*
	Element start_elem = mDoc.select("p[id=section18]").first();			
	Element stop_elem = mDoc.select("p[id=section19]").first();	
	*/
	Element pe = start_elem.nextElementSibling(); 
	if (pe!=null && start_elem!=null && stop_elem!=null) {
		while (pe!=stop_elem) {
			System.out.println(pe.text());
			pe = pe.nextElementSibling();			
		}
	}		
	return pack_section;
}

开发者ID:zdavatz，项目名称:aips2xml，代码行数:23，代码来源:HtmlUtils.java

示例3: htmlTextToPlainText

import org.jsoup.nodes.Entities.EscapeMode; //导入依赖的package包/类
/**
 * Cleans some html text by stripping all tags but <code>br</code> and then
 * unescapes named entitiesl like '&quote';. brs will be replaced by
 * newlines.
 *
 * @param htmlText
 * @return
 */
String htmlTextToPlainText(final String htmlText) {
    final Whitelist whitelist = Whitelist.none();
    whitelist.addTags("br");
    final Cleaner cleaner = new Cleaner(whitelist);
    final Document cleanedDocument = cleaner.clean(Jsoup.parse(htmlText));
    cleanedDocument
            .outputSettings()
            .prettyPrint(false)
            .escapeMode(EscapeMode.xhtml)
            .charset(StandardCharsets.UTF_8);
    return Parser.unescapeEntities(cleanedDocument.body().html().trim(), true).replaceAll("<br(?: ?/)?>", "\r\n");
}

开发者ID:EuregJUG-Maas-Rhine，项目名称:site，代码行数:21，代码来源:RegistrationService.java

示例4: stripXSS

import org.jsoup.nodes.Entities.EscapeMode; //导入依赖的package包/类
/**
 * Strips any potential XSS threats out of the value
 *
 * @param value
 * @return
 */
public String stripXSS(String value) {
	if (StringUtils.isBlank(value)) {
		return null;
	}
	// try {
	// value = ESAPI.encoder().encodeForHTML(value);
	// } catch (Exception e) {
	// logger.warn(e.getMessage(),e); //
	// }

	// Use the ESAPI library to avoid encoded attacks.
	value = ESAPI.encoder().canonicalize(value);
	//
	// // Avoid null characters
	value = value.replaceAll("\0", "");
	value = value.replaceAll("<", "& lt;").replaceAll(">", "& gt;");
	value = value.replaceAll("\\(", "& #40;").replaceAll("\\)", "& #41;");
	value = value.replaceAll("'", "& #39;");
	value = value.replaceAll("eval\\((.*)\\)", "");
	value = value.replaceAll("[\\\"\\\'][\\s]*javascript:(.*)[\\\"\\\']", "\"\"");
	value = value.replaceAll("script", "");
	//
	// // Clean out HTML
	Document.OutputSettings outputSettings = new Document.OutputSettings();
	outputSettings.escapeMode(EscapeMode.xhtml);
	outputSettings.prettyPrint(false);
	value = Jsoup.clean(value, "", Whitelist.none(), outputSettings);
	return value;
}

开发者ID:xuegongzi，项目名称:rabbitframework，代码行数:36，代码来源:XSSFilter.java

示例5: parseContent

import org.jsoup.nodes.Entities.EscapeMode; //导入依赖的package包/类
Document parseContent(final String content) {
    Document document = Jsoup.parse(content);
    document.outputSettings().escapeMode(EscapeMode.xhtml);
    document.outputSettings().syntax(Document.OutputSettings.Syntax.xml);

    // remove script tags, they are not supported in pdf and can lead to
    // not well formed document (e.g. <\/script> - escaped script tag)
    document.select("script").remove();

    return document;
}

开发者ID:aleksandr-m，项目名称:struts2-pdfstream，代码行数:12，代码来源:PdfStreamResult.java

示例6: parse

import org.jsoup.nodes.Entities.EscapeMode; //导入依赖的package包/类
public static IDocument parse(String s) {
	org.jsoup.nodes.Document doc = Jsoup.parse(s);
	
	if (NOT_USE_HTML_ENCODE) {
		doc.outputSettings().escapeMode(EscapeMode.xhtml);
	}

	return new Document(doc);
}

开发者ID:Taulukko，项目名称:taulukko-commons-parsers，代码行数:10，代码来源:HTMLParser.java

示例7: clean

import org.jsoup.nodes.Entities.EscapeMode; //导入依赖的package包/类
/**
 * Clean HTML string and return the cleaner version.
 * 
 * @param html Input HTML string.
 * @return Cleaned version of the HTML as string.
 */
public String clean(String html)
{
	// Parser str into a Document
	Document doc = Jsoup.parse(html);
	// Clean the document
	doc = new Cleaner(wl).clean(doc);
	// Adjust escape mode
	doc.outputSettings().escapeMode(EscapeMode.xhtml);

	// Get back the string of the Document
	return doc.html();
}

开发者ID:emir-munoz，项目名称:uraptor，代码行数:19，代码来源:HtmlCleaner.java

示例8: ScheduleDocument

import org.jsoup.nodes.Entities.EscapeMode; //导入依赖的package包/类
public ScheduleDocument(){
	doc = Document.createShell("");
	doc.outputSettings().escapeMode(EscapeMode.xhtml);
	DocumentType type = new DocumentType("html", "", "", "");
	doc.prependChild(type);
	doc.select("html").attr("class", "js no-touch geolocation backgroundsize csstransforms csstransforms3d audio localstorage inlinesvg pointerevents webaudio mediaqueries getusermedia");
	
}

开发者ID:arscyper，项目名称:adan，代码行数:9，代码来源:ScheduleDocument.java

示例9: addHeaderToXml

import org.jsoup.nodes.Entities.EscapeMode; //导入依赖的package包/类
static String addHeaderToXml(String xml_str) {	
	Document mDoc = Jsoup.parse("<kompendium>\n" + xml_str + "</kompendium>");
	mDoc.outputSettings().escapeMode(EscapeMode.xhtml);
	mDoc.outputSettings().prettyPrint(true);
	mDoc.outputSettings().indentAmount(4);
	
	// Add date
	Date df = new Date();
	String date_str = df.toString();
	mDoc.select("kompendium").first().prependElement("date");
	mDoc.select("date").first().text(date_str);
	// Add language
	mDoc.select("date").after("<lang></lang>");
	if (DB_LANGUAGE.equals("de"))
		mDoc.select("lang").first().text("DE");
	else if (DB_LANGUAGE.equals("fr"))
		mDoc.select("lang").first().text("FR");

	// Fool jsoup.parse which seems to have its own "life" 
	mDoc.select("tbody").unwrap();
	Elements img_elems = mDoc.select("img");
	for (Element img_e : img_elems) {
		if (!img_e.hasAttr("src"))
			img_e.unwrap();
	}
	mDoc.select("img").tagName("image");
	
	String final_xml_str = mDoc.select("kompendium").first().outerHtml();		
	
	return final_xml_str;
}

开发者ID:zdavatz，项目名称:aips2xml，代码行数:32，代码来源:Aips2Xml.java

示例10: extractHtmlSection

import org.jsoup.nodes.Entities.EscapeMode; //导入依赖的package包/类
static String[] extractHtmlSection(MedicalInformations.MedicalInformation m) {	
	// Extract section titles and section ids
	MedicalInformations.MedicalInformation.Sections med_sections = m.getSections();
	List<MedicalInformations.MedicalInformation.Sections.Section> med_section_list = med_sections.getSection();

	Document doc = Jsoup.parse(m.getContent());
	doc.outputSettings().escapeMode(EscapeMode.xhtml);
	
	// Clean html code
	HtmlUtils html_utils = new HtmlUtils(m.getContent());
	html_utils.clean();					
	
	// Extract registration number (swissmedic no5)
	String regnr_str = "";
	if (DB_LANGUAGE.equals("de"))
		regnr_str = html_utils.extractRegNrDE(m.getTitle());
	else if (DB_LANGUAGE.equals("fr"))
		regnr_str = html_utils.extractRegNrFR(m.getTitle());
	
	// Sanitize html
	String html_sanitized = "";								
	// First check for bad boys (version=1! but actually version>1!)
	if (!m.getVersion().equals("1") || m.getContent().substring(0, 20).contains("xml")) {
		for (int i=1; i<22; ++i) {
			html_sanitized += html_utils.sanitizeSection(i, m.getTitle(), DB_LANGUAGE);
		}
		html_sanitized = "<div id=\"monographie\">" + html_sanitized + "</div>" ;
	} else {
		html_sanitized = m.getContent();
	}
	
	// Update "Packungen" section and extract therapeutisches index
	List<String> mTyIndex_list = new ArrayList<String>();						
	String mContent_str = updateSectionPackungen(m.getTitle(), package_info, regnr_str, html_sanitized, mTyIndex_list);
	
	// Add meta-tag and link
	mContent_str = mContent_str.replaceAll("<head>", "<head>" +
			"<link href=\"amiko_stylesheet.css\" rel=\"stylesheet\" type=\"text/css\"></>" +
			"<meta http-equiv=\"Content-Type\" content=\"text/html; charset=utf-8\">");

	m.setContent(mContent_str);		
	
	// Fix problem with wrong div class in original Swissmedic file
	if (DB_LANGUAGE.equals("de")) {
		m.setStyle(m.getStyle().replaceAll("untertitel", "untertitle"));
		m.setStyle(m.getStyle().replaceAll("untertitel1", "untertitle1"));
	}
	
	// Correct formatting error introduced by Swissmedic
	m.setAuthHolder(m.getAuthHolder().replaceAll("&#038;","&"));
	
	// Extracts only *first* registration number
	/*
	List<String> swissmedicno5_list = Arrays.asList(regnr_str.split("\\s*,\\s*"));		
	String[] swno5_content_map = {swissmedicno5_list.get(0), mContent_str};
	*/
	// Extract *all* registration numbers
	String[] swno5_content_map = {regnr_str, mContent_str};
	
	return swno5_content_map; //mContent_str;
}

开发者ID:zdavatz，项目名称:aips2xml，代码行数:62，代码来源:Aips2Xml.java

注：本文中的org.jsoup.nodes.Entities.EscapeMode类示例整理自Github/MSDocs等源码及文档管理平台，相关代码片段筛选自各路编程大神贡献的开源项目，源码版权归原作者所有，传播和使用请参考对应项目的License；未经允许，请勿转载。

鲜花

握手

雷人

路过

鸡蛋

该文章已有0人参与评论

请发表评论

全部评论

专题导读

More+

10-27 六六分期app的软件客服如何联系？(六六分期

11-06 可心卡盟:win10系统火狐flash插件崩溃怎么

11-06 亲亲特价:怎么删除回收站图标

11-06 济南大学虚拟社区:鲁大师节能降温的具体办

11-06 xlueops.exe:无线网络安装向导

11-06 女斗合众国:win7系统cf与主机连接不稳定怎

11-06 0xc000022-[cf烟雾头]cf怎么调烟雾头

11-06 qizideyouhuo:应用程序无法正常启动0xc0000

11-06 ipz-185:win7系统vcf文件怎么打开

11-06 傻哥蹦迪:win10系统s4怎么打开usb调试

11-06 八神浩树gtaste:回收站清空了怎么恢复

11-06 妖尾之黑色守护:win10系统电脑没有1440x900

11-06 校园至尊魔王小说:win7系统浏览网页时字体

11-06 女斗合众国:win10系统访问共享文件夹提示请

11-06 tokyo hot n0654:恢复win7系统默认字体一招

11-06 雨酷仙境:设置win7系统转移临时文件夹腾出

11-06 阿穆纳伊之杖:win7系统开始菜单在右边还原

11-06 tunespotting:win10系统火狐flash插件总是

11-06 甘尔葛分析师：计谋网站seo关键词暴涨有什

11-06 蔡贵霖: 计谋网站seo关键词暴涨有什么秘密

11-06 博益网首页:ao3网页版进入不了解决方法

11-06 漏斗子专栏: 网站数据分析小白易懂精华篇

11-06 见证双虹怎么做:win7系统开启telnet命令的

11-06 颾狐蝶蜋:系统资源不足无法完成请求的服务

11-06 国光中学校歌:提交网站到alexa查询详细步骤

11-06 西安有情天:静态网页和动态网页的区别

11-06 红木雅尚斋:外部链接构造对网站的好处

11-06 前官礼遇：防止域名劫持–增强域安全性的10

11-06 密传二转答案: 中文分词算法有哪些

11-06 金泉家园邮编:百度快照劫持的表现及应对方

Java AppDescriptor类代码示例发布时间：2022-05-22

Java GeometryComponentFilter类代码示例发布时间：2022-05-22

剪的笔顺,诠释剪的笔画,认识剪的部首

1 六六分期app的软件客服如何联系？(六六分期

六六分期app的软件客服如何联系？不知道吗？加qq群【895510560】即可！标题：六六分期

阅读：18020|2023-10-27

2 可心卡盟:win10系统火狐flash插件崩溃怎么

今天小编告诉大家如何处理win10系统火狐flash插件总是崩溃的问题，可能很多用户都不知

阅读：9592|2022-11-06

3 亲亲特价:怎么删除回收站图标

今天小编告诉大家如何对win10系统删除桌面回收站图标进行设置，可能很多用户都不知道

阅读：8140|2022-11-06

4 济南大学虚拟社区:鲁大师节能降温的具体办

今天小编告诉大家如何对win10系统电脑设置节能降温的设置方法，想必大家都遇到过需要

阅读：8522|2022-11-06

5 xlueops.exe:无线网络安装向导

我们在使用xp系统的过程中,经常需要对xp系统无线网络安装向导设置进行设置，可能很多

阅读：8425|2022-11-06

6 女斗合众国:win7系统cf与主机连接不稳定怎

今天小编告诉大家如何处理win7系统玩cf老是与主机连接不稳定的问题，可能很多用户都不

阅读：9329|2022-11-06

7 0xc000022-[cf烟雾头]cf怎么调烟雾头

电脑对日常生活的重要性小编就不多说了，可是一旦碰到win7系统设置cf烟雾头的问题，很

阅读：8389|2022-11-06

8 qizideyouhuo:应用程序无法正常启动0xc0000

我们在日常使用电脑的时候，有的小伙伴们可能在打开应用的时候会遇见提示应用程序无法

阅读：7823|2022-11-06

9 ipz-185:win7系统vcf文件怎么打开

今天小编告诉大家如何对win7系统打开vcf文件进行设置，可能很多用户都不知道怎么对win

阅读：8378|2022-11-06

10 傻哥蹦迪:win10系统s4怎么打开usb调试

今天小编告诉大家如何对win10系统s4开启USB调试模式进行设置，可能很多用户都不知道怎

阅读：7370|2022-11-06

客服电话

电子邮件

Java EscapeMode类代码示例

示例1: cleanContent

示例2: extractPackSection

示例3: htmlTextToPlainText

示例4: stripXSS

示例5: parseContent

示例6: parse

示例7: clean

示例8: ScheduleDocument

示例9: addHeaderToXml

示例10: extractHtmlSection

请发表评论

全部评论

上一篇：

下一篇：

librespeed/speedtest: Self-hosted Speedt

avehtari/BDA_m_demos: Bayesian Data Anal

四维彩超怎么看性别？四维看男孩女孩诀窍

matlab 数组操作作业

medfreeman/markdown-it-toc-and-anchor: m

剪的笔顺,诠释剪的笔画,认识剪的部首

六六分期app的软件客服如何联系？(六六分期

florent37/ViewAnimator: A fluent Android

florent37/Shrine-MaterialDesign2: implem

CVE-2020-36276

SimpleSoftwareIO/simple-sms: Send and re

关于我们

产品与服务

解决方案

139-2527-9053