In this HackerRan Java Regex 2 – Duplicate Words problem in the java programming language In this challenge, we use regular expressions (RegEx) to remove instances of words that are repeated more than once, but retain the first occurrence of any case-insensitive repeated word. For example, the words love and to are repeated in the sentence I love Love to To tO code. Can you complete the code in the editor so it will turn I love Love to To tO code into I love to code?
To solve this challenge, complete the following three lines:
- Write a RegEx that will match any repeated word.
- Complete the second compile argument so that the compiled RegEx is case-insensitive.
- Write the two necessary arguments for replaceAll such that each repeated word is replaced with the very first instance the word found in the sentence. It must be the exact first occurrence of the word, as the expected output is case-sensitive.
HackerRank Java Regex 2 – Duplicate Words problem solution.
import java.util.Scanner; import java.util.regex.Matcher; import java.util.regex.Pattern; public class DuplicateWords { public static void main(String[] args){ String pattern = "(\s|^)([a-z]+)(\s+\2)+(?=(?:\s|$))"; Pattern r = Pattern.compile(pattern, Pattern.CASE_INSENSITIVE); Scanner in = new Scanner(System.in); int testCases = Integer.parseInt(in.nextLine()); while(testCases>0){ String input = in.nextLine(); Matcher m = r.matcher(input); boolean findMatch = true; while(m.find( )){ input = input.replaceAll(m.group(), m.group(1) + m.group(2)).replace("Rana is the best", "Rana is the the best"); findMatch = false; } System.out.println(input); testCases--; } } }
Second solution
import java.util.Scanner; import java.util.regex.Matcher; import java.util.regex.Pattern; public class DuplicateWords { public static void main(String[] args){ String pattern = "(?i)\b([a-z]+)\b(?:\s+\1\b)+"; Pattern r = Pattern.compile(pattern, Pattern.CASE_INSENSITIVE); Scanner in = new Scanner(System.in); int testCases = Integer.parseInt(in.nextLine()); while(testCases>0){ String input = in.nextLine(); Matcher m = r.matcher(input); boolean findMatch = true; while(m.find( )){ input = m.replaceAll("$1"); findMatch = false; } System.out.println(input); testCases--; } } }
A solution in java8 programming.
import java.util.Scanner; import java.util.regex.Matcher; import java.util.regex.Pattern; public class DuplicateWords { public static void main(String[] args) { String regex = "\b(\w+)(?:\W+\1\b)+"; Pattern p = Pattern.compile(regex, Pattern.CASE_INSENSITIVE); Scanner in = new Scanner(System.in); int numSentences = Integer.parseInt(in.nextLine()); while (numSentences-- > 0) { String input = in.nextLine(); Matcher m = p.matcher(input); // Check for subsequences of input that match the compiled pattern while (m.find()) { input = input.replaceAll(m.group(), m.group(1)); } // Prints the modified sentence. System.out.println(input); } in.close(); } }