当前位置：移动技术网 > IT编程>开发语言>Java > 谈谈为JAXB和response设置编码，解决wechat4j中文乱码的问题

谈谈为JAXB和response设置编码，解决wechat4j中文乱码的问题

2019年07月22日 | 移动技术网IT编程 | 我要评论

郄英才是谁的秘书,mhyoes,江阴车震门

如果有哪一个做程序员的小伙伴说自己没有遇到中文乱码问题，我是不愿意相信的。今天在做微信订阅号的智能回复时，又一时迷乱的跳进了中文乱码这个火坑。刚解决问题时，都欢呼雀跃了，完全忘记了她曾经带给我的痛苦。

一、问题描述

这里写图片描述

看到没，红色框框内的乱码赤裸裸的对我进行挑衅，而我却无可奈何，真是糟糕透顶。

二、寻求解决之道

面对问题，只有拿着刀逼自己去解决啊，能怎么样呢？

首先，必须搞清楚微信智能回复的机制，画图如下：

ps，工具用得不好，请见谅。

接下来，我们抓重点，看乱码重要发生在什么位置。

1.controller返回给用户

response.setheader("content-type", "text/html;charset=utf-8");// 浏览器编码
response.getoutputstream().write(result.getbytes());

就这段代码了，指定response的编码方式为utf-8，按理说乱码问题应该出现好转，但是结果依然是没有。

2.jaxb的toxml

public string toxml(object obj) {
  string result = null;
  try {
    jaxbcontext context = jaxbcontext.newinstance(obj.getclass());
    marshaller m = context.createmarshaller();

    m.setproperty(marshaller.jaxb_encoding, "utf-8");
    m.setproperty(marshaller.jaxb_formatted_output, true);
    m.setproperty(marshaller.jaxb_fragment, true);// 去掉报文头

    bytearrayoutputstream os = new bytearrayoutputstream();
    xmlserializer serializer = getxmlserializer(os);

    m.marshal(obj, serializer.ascontenthandler());

    result = os.tostring("utf-8");
  } catch (exception e) {
    e.printstacktrace();
  }
  logger.info("response text:" + result);
  return result;
}
private xmlserializer getxmlserializer(outputstream os) {
  outputformat of = new outputformat();
  formatcdatatag();
  of.setcdataelements(cdatanode);
  of.setpreservespace(true);
  of.setindenting(true);
  of.setomitxmldeclaration(true);

  of.setencoding("utf-8");
  xmlserializer serializer = new xmlserializer(of);
  serializer.setoutputbytestream(os);
  return serializer;
}

这里有三个关键的点：

1. m.setproperty(marshaller.jaxb_encoding, "utf-8");

2. getxmlserializer(os)

3. os.tostring("utf-8");

可以看到以上三个地方均会涉及到转码，第1处，设置marshaller的编码；第二处，设置整个xmlserializer的编码；第三处，设置返回的bytearrayoutputstream的string编码。三处缺一不可。

这次这么透彻，应该解决了问题了吧，但是解决依然中文乱码，那该如何是好呢？

3.tomcat的输出环境作怪

针对这一点，网上有人提供这样的解决思路。

set java_opts=%java_opts% %logging_manager% -dfile.encoding=utf-8

设置后重启tomcat，问题是能够解决，但副作用是整个tomcat在服务器上运行输出（tomcat的cmd窗口）一直是乱码，我认为这种方案不可取。

在运行的war中加入以下代码

system.getproperty("file.encoding");

你会惊奇的发现，tomcat的运行环境（window server 2008）竟然是gbk，不知道你是否不惊奇，我是吓到了，为什么不是utf-8呢？如果是gbk的话，上面两个步骤中我加入再多的utf-8页扯淡啊，不解。

三、解决问题

有了以上的经验，我们修改以下wechat4j的代码，主要是第二点。

public string toxml(object obj) {
  string result = null;
  try {
    jaxbcontext context = jaxbcontext.newinstance(obj.getclass());
    marshaller m = context.createmarshaller();

    string encoding = config.instance().getjaxb_encoding();
    logger.debug("toxml encoding " + encoding + "system file.encoding " + system.getproperty("file.encoding"));

    m.setproperty(marshaller.jaxb_encoding, encoding);
    m.setproperty(marshaller.jaxb_formatted_output, true);
    m.setproperty(marshaller.jaxb_fragment, true);// 去掉报文头

    bytearrayoutputstream os = new bytearrayoutputstream();
    xmlserializer serializer = getxmlserializer(os);

    m.marshal(obj, serializer.ascontenthandler());

    result = os.tostring(encoding);
  } catch (exception e) {
    e.printstacktrace();
  }
  logger.info("response text:" + result);
  return result;
}

private xmlserializer getxmlserializer(outputstream os) {
  outputformat of = new outputformat();
  formatcdatatag();
  of.setcdataelements(cdatanode);
  of.setpreservespace(true);
  of.setindenting(true);
  of.setomitxmldeclaration(true);

  string encoding = config.instance().getjaxb_encoding();
  of.setencoding(encoding);
  xmlserializer serializer = new xmlserializer(of);
  serializer.setoutputbytestream(os);
  return serializer;
}

这两个方法中，对encoding我们加上可配置的编码方式，可手动设置gbk（我的服务器上配置了gbk）、gb2312、utf-8。

如此，会发现wechat4j的后台输出就不再是中文乱码了，但返回给用户的信息更乱了。

这里写图片描述

怎么能这样呢，耍我这枚程序员啊，真想吐两句脏话。但别怕啊，既然wechat4j的logger日志不再中文乱码，那么只能说是第1个环节又出现问题了。

调整嘛

response.setheader("content-type", "text/html;charset=utf-8");// 浏览器编码
response.getoutputstream().write(result.getbytes("utf-8"));

注意，这里不能是gbk，只能是utf-8，我表示不清楚为什么，微信的产品经理给出来解释下。

重点，jaxb和response合伙解决wechat4j中文乱码的方法再次声明如下：

wechatcontroller.java，就是你配给微信公众开发平台的url处，response调整如下

response.setheader("content-type", "text/html;charset=utf-8");// 浏览器编码
response.getoutputstream().write(result.getbytes("utf-8"));

wechat4j的jaxbparser.java，分别调整toxml(object obj)和getxmlserializer(outputstream os)方法：

public string toxml(object obj) {
  string result = null;
  try {
    jaxbcontext context = jaxbcontext.newinstance(obj.getclass());
    marshaller m = context.createmarshaller();

    string encoding = config.instance().getjaxb_encoding();// gbk
    logger.debug("toxml encoding " + encoding + "system file.encoding " + system.getproperty("file.encoding"));

    m.setproperty(marshaller.jaxb_encoding, encoding);
    m.setproperty(marshaller.jaxb_formatted_output, true);
    m.setproperty(marshaller.jaxb_fragment, true);// 去掉报文头

    bytearrayoutputstream os = new bytearrayoutputstream();
    xmlserializer serializer = getxmlserializer(os);

    m.marshal(obj, serializer.ascontenthandler());

    result = os.tostring(encoding);
  } catch (exception e) {
    e.printstacktrace();
  }
  logger.info("response text:" + result);
  return result;
}
private xmlserializer getxmlserializer(outputstream os) {
  outputformat of = new outputformat();
  formatcdatatag();
  of.setcdataelements(cdatanode);
  of.setpreservespace(true);
  of.setindenting(true);
  of.setomitxmldeclaration(true);

  string encoding = config.instance().getjaxb_encoding();//gbk
  of.setencoding(encoding);
  xmlserializer serializer = new xmlserializer(of);
  serializer.setoutputbytestream(os);
  return serializer;
}

好了，万事大吉了。

这里写图片描述

以上就是本文的全部内容，希望对大家的学习有所帮助，也希望大家多多支持移动技术网。

您可能感兴趣的文章:

如对本文有疑问，请在下面进行留言讨论，广大热心网友会与你互动！！点击进行留言回复

Spring Boot如何优雅的使用多线程实例详解

前言本文带你快速了解@async注解的用法，包括异步方法无返回值、有返回值，最后总结了@async注解失效的几个坑。在 springboot 应用中，经常会遇到... [阅读全文]
浅析我对 String、StringBuilder、StringBuffer 的理解

stringbuilder、stringbuffer 和 string 一样，都是用于存储字符串的。1、那既然有了 string ，为什么还需要他们两个呢？原因... [阅读全文]
Spring Boot加密配置文件特殊内容的示例代码详解

有时安全不得不考虑，看看新闻泄漏风波事件就知道了我们在用spring boot进行开发时，经常要配置很多外置参数ftp、数据库连接信息、支付信息等敏感隐私信息，... [阅读全文]
如何去除Java中List集合中的重复数据

1.循环list中的所有元素然后删除重复public class duplicatremoval {public static list removedupli... [阅读全文]
使用IDEA搭建SSM框架的详细教程(spring + springMVC +MyBatis)

1 框架组成springspringmvcmybatis2 所需工具mysql 8.0.15数据库管理系统，创建数据库tomcat 8.5.51&... [阅读全文]
Springboot整合freemarker 404问题解决方案

今天遇到了ftl整合springboot出现的问题@controllerpublic class indexcontroller { @requestmapp... [阅读全文]
Java面向对象之继承性的实例代码详解

一、类的继承a类继承b类，是指a类可以拥有b类的非私有属性和方法，同时a类也可以自己定义属性方法或重写方法以扩充自己的功能。1.1 方法的重写重写方法时，方法的... [阅读全文]
引入mybatis-plus报 Invalid bound statement错误问题的解决方法

错误mybatis-plus (简称mp) 是mybatis的一个增强工具，在mybatis的基础上只做增强不做改变，简化了开发效率。其实就是帮我们封装了一些简... [阅读全文]
Java rmi远程方法调用基本用法解析

本文主要介绍java中的rmi的基本使用1：项目架构api：主要是接口的定义，url地址，端口号rmiconsumer：rmi服务的调用者rmiserver：r... [阅读全文]
Matlab及Java实现小时钟效果

本文实例为大家分享了matlab及java实现小时钟的具体代码，供大家参考，具体内容如下一年前曾经用matlab的gui做了一个时钟，由于是直接用guide和a... [阅读全文]

网友评论


验证码：

谈谈为JAXB和response设置编码，解决wechat4j中文乱码的问题

2019年07月22日 | 移动技术网IT编程 | 我要评论

您可能感兴趣的文章:

相关文章:

网友评论