ElyxAI
data manipulation

How to How to Remove Duplicate Words in Excel

Shortcut:Ctrl+H (Find & Replace) or Ctrl+Shift+V (Paste Special)
Excel 2016Excel 2019Excel 2021Excel 365

Learn to remove duplicate words from text strings within Excel cells using formulas and built-in features. This essential data-cleaning skill ensures your text data is clean, professional, and ready for analysis or reporting.

Why This Matters

Removing duplicate words improves data quality and prevents errors in analysis, especially when working with customer feedback, product descriptions, or data entry errors.

Prerequisites

  • Basic Excel skills (cell selection, formulas)
  • Understanding of text functions (MID, FIND, SUBSTITUTE)

Step-by-Step Instructions

1

Prepare Your Data

Select the column containing text with potential duplicate words. Create a helper column next to your data to write the formula.

2

Use a Formula Approach

Enter a REGEX or complex SUBSTITUTE formula in the helper column. For simple cases, use nested SUBSTITUTE functions: =SUBSTITUTE(SUBSTITUTE(A1,"word word","word"),"word word","word").

3

Apply Formula to All Rows

Copy the formula down to all rows with data using Ctrl+C, then select the range and paste with Ctrl+V.

4

Copy Results as Values

Select all formula results, copy (Ctrl+C), then use Paste Special (Ctrl+Shift+V) and select 'Values Only' to remove formulas.

5

Delete Original Column

Delete the original column with duplicates and rename the helper column to the original name if needed.

Alternative Methods

Find & Replace Method

Use Ctrl+H to open Find & Replace and manually remove common duplicate word patterns. This works well for known duplicates but is less scalable.

Power Query (Excel 365)

Use Data > From Table/Range, then transform columns using Power Query's built-in text functions for more robust duplicate removal.

VBA Macro

Create a custom VBA macro to process multiple cells simultaneously, ideal for large datasets with complex duplicate patterns.

Tips & Tricks

  • Test your formula on a single cell first before applying it to the entire column to avoid errors.
  • Use wildcards and REGEX in Excel 365 for more flexible pattern matching when removing duplicates.
  • Keep a backup of your original data before making large-scale replacements.

Pro Tips

  • Use TRIM() alongside SUBSTITUTE() to remove extra spaces that often appear after removing duplicates.
  • For case-insensitive duplicate removal, combine UPPER() or LOWER() with your formula.
  • Use array formulas (Ctrl+Shift+Enter) in older Excel versions for more complex duplicate-word scenarios.

Troubleshooting

Formula returns #VALUE! error

Check that your cell references are correct and contain text data, not numbers. Ensure parentheses are properly closed in your formula.

Some duplicate words still remain after formula

Add additional SUBSTITUTE layers for other duplicate patterns, or use REGEX function in Excel 365 for more comprehensive matching.

Original column won't delete due to circular references

Ensure you've converted all formulas to values using Paste Special > Values before deleting the source column.

Related Excel Formulas

Frequently Asked Questions

Can I remove all types of duplicate words at once?
Yes, using REGEX in Excel 365 or by chaining multiple SUBSTITUTE functions for known patterns. For unknown patterns, a VBA macro is more efficient.
Will this method work with phrases, not just single words?
Absolutely. Replace "word" in the SUBSTITUTE formula with your phrase, and it will work the same way for duplicate phrases.
Is there a way to preserve the order of words when removing duplicates?
Simple SUBSTITUTE removes all instances equally. For order-aware removal, use VBA or Power Query with more sophisticated logic.

This was one task. ElyxAI handles hundreds.

Sign up