使用正则表达式从数据中获取特定的字符串

问题描述:

我想从下面的数据中获取特定的字符串。在这里分享示例数据的时间太长了。由此我必须得到'france24Id=7GHYUFGty6fdGFHyy56' 与正则表达式不太相似。使用正则表达式从数据中获取特定的字符串

如何从上面的数据中检索字符串'france24Id = 7GHYUFGty6fdGFHyy56'?

我试着用''分割数据,但它不是一个有效的方法。所以我选择正则表达式。

2016-07-29 12:08:46,260 s=xGuide, [[email protected][,connection=WebSocketConnectionWrapper[/1.8.9]]] INFO c=c.ore., - onConnect event payload={minimumVersion='0', applicationName='shell', fetl='555', authenticationToken='6y777', sessionAuthToken='', sessionGUID='null', connectURL='http://section?ruleName=Default', partnerId='hp', nativeDimensions=null, appParams={heartbeatRequest=1, france24Id=7GHYUFGty6fdGFHyy56, service=false, networkBuffer={min=150, max=150, step=0}}, deviceCaps={platform=Mac, receiverType=Native, revisions={protocol=1, auth=1, video=1}, pixelDimensions=[1280, 720]}, forceSource=null, reconnect=false, currentCommandIndex=0, reconnectReason=7, authService=9} 
+0

您是否关心您不感兴趣的引用字符串之一(例如applicationName)是否包含相同的确切字符串? –

你可以得到你想要什么(france\d+Id)=([a-zA-Z0-9]+),。这将抓住你的字符串并将它的两部分转储到平台适当的捕获组变量中(例如,在Perl中分别为$1$2)。

在Java中,你的代码看起来有点像这样:

import java.util.regex.Matcher; 
import java.util.regex.Pattern; 

public String matchID(String data) { 
    Pattern r = new Pattern("(france\\d+Id)=([a-zA-Z0-9]+),"); 
    Matcher m = r.matcher(data); 
    return m.group(2); 
} 
+0

我如何用java做这个 – Psl

public static void main(String[] args) { 
    String str = "2016-07-29 12:08:46,260 s=xGuide, [[email protected][,connection=WebSocketConnectionWrapper[/1.8.9]]] INFO c=c.ore., - onConnect event payload={minimumVersion='0', applicationName='shell', fetl='555', authenticationToken='6y777', sessionAuthToken='', sessionGUID='null', connectURL='http://section?ruleName=Default', partnerId='hp', nativeDimensions=null, appParams={heartbeatRequest=1, france24Id=7GHYUFGty6fdGFHyy56, service=false, networkBuffer={min=150, max=150, step=0}}, deviceCaps={platform=Mac, receiverType=Native, revisions={protocol=1, auth=1, video=1}, pixelDimensions=[1280, 720]}, forceSource=null, reconnect=false, currentCommandIndex=0, reconnectReason=7, authService=9}"; 
    String regex = ".*(france24Id=[\\d|\\w]*),.*"; 
    Pattern pattern = Pattern.compile(regex); 
    Matcher matcher = pattern.matcher(str); 
    if(matcher.matches()){ 
      System.out.println(matcher.group(1)); 
    } 
} 

你也可以试试这个:

String str = "france24Id=7GHYUFGty6fdGFHyy56"; 
Pattern pattern = Pattern.compile("(?<=france24Id=)([a-zA-Z0-9]+)"); 
Matcher matcher = pattern.matcher(str); 
if (matcher.find()) { 
    System.out.println("ID = " + matcher.group()); 
} 

,其结果是:

ID = 7GHYUFGty6fdGFHyy56 
+0

看起来这个正则表达式会匹配“france24Id =”后面的所有内容,所以你可能想使用[a-zA-Z0-9]而不是'。'。 –

+0

是的你是对的。 –

您可以在Java中使用Pattern和Matcher类。

String data = "2016-07-29 12:08:46,260 s=xGuide, [[email protected][,connection=WebSocketConnectionWrapper[/1.8.9]]] INFO c=c.ore., - onConnect event payload={minimumVersion='0', applicationName='shell', fetl='555', authenticationToken='6y777', sessionAuthToken='', sessionGUID='null', connectURL='http://section?ruleName=Default', partnerId='hp', nativeDimensions=null, appParams={heartbeatRequest=1, france24Id=7GHYUFGty6fdGFHyy56, service=false, networkBuffer={min=150, max=150, step=0}}, deviceCaps={platform=Mac, receiverType=Native, revisions={protocol=1, auth=1, video=1}, pixelDimensions=[1280, 720]}, forceSource=null, reconnect=false, currentCommandIndex=0, reconnectReason=7, authService=9}"; 
String regex1 = "france24Id=[a-zA-Z0-9]+"; //this matches france24Id=7GHYUFGty6fdGFHyy56 
    String regex2 = "(?<=france24Id=)[a-zA-Z0-9]+"; //this matches 7GHYUFGty6fdGFHyy56 or whatever after "france24Id=" and before ',' 
    Pattern pattern1 = Pattern.compile(regex1); 
    Pattern pattern2 = Pattern.compile(regex2); 
    Matcher matcher1 = pattern1.matcher(data); 
    Matcher matcher2 = pattern2.matcher(data); 

    String result1, result2; 

    if(matcher1.find()) 
     result1 = matcher1.group(); //if match is found, result1 should contain "france24Id=7GHYUFGty6fdGFHyy56" 

    if(matcher2.find()) 
     result2 = matcher2.group(); //if match is found, result1 should contain "7GHYUFGty6fdGFHyy56"