How to Clean Up HubSpot Contacts: The Complete Guide (2026)
If your HubSpot CRM has been running for more than six months, your contact database is almost certainly a mess. Here is exactly how to find and fix the most common data quality issues — and how to keep them from coming back.
Why Dirty CRM Data Is Costing You Money
Most HubSpot portals have 20-40% of their contacts with at least one missing critical field. That means one in every three or four leads your sales team touches is incomplete — no phone number, no company name, no job title. Reps waste time researching information that should already be in the system, or worse, they skip the contact entirely.
Duplicate contacts are equally damaging. When the same person exists twice in your CRM, they might get the same email twice, two different reps might reach out to them, and your pipeline reports double-count revenue. Research from Gartner suggests that bad data costs organizations an average of $12.9 million per year.
The good news: most of these issues are fixable. Let us walk through them one by one.
Step 1: Find and Merge Duplicate Contacts
Duplicates are the most common CRM data issue. They happen when a contact fills out multiple forms with slightly different information, when leads are imported from different sources, or when sales reps manually create contacts that already exist.
HubSpot has a built-in duplicate management tool. Go to Contacts → Actions → Manage Duplicates. HubSpot will show you pairs of contacts it thinks are duplicates. You can review each pair and choose to merge or dismiss them.
The problem with this approach: HubSpot only shows you a limited number of duplicates at a time, and it can miss duplicates where the email matches but the name is slightly different. For portals with thousands of contacts, manually reviewing duplicates is painfully slow.
A faster approach is to export your contacts to a CSV, sort by email address, and look for exact email matches. Any row where the email appears more than once is a definite duplicate. You can then merge them in HubSpot manually or use a tool that automates the process.
Step 2: Fill in Missing Company Names
Missing company names make it nearly impossible to segment your contacts by account, run account-based marketing campaigns, or assign leads to the right rep. Yet it is one of the most neglected fields in most CRMs.
Here is a trick most people overlook: the company name is often sitting right inside the email address. If a contact's email is sarah@acmeplumbing.com, the company is Acme Plumbing. If it is dave@smithconstruction.com, the company is Smith Construction.
This works for any business email domain. The only exceptions are personal email providers like Gmail, Yahoo, and Outlook — those do not tell you the company. But in a typical B2B HubSpot portal, 40-60% of contacts with missing company names have a business email domain, which means you can fill in the company automatically.
You can do this manually by filtering contacts in HubSpot where Company is unknown and email is known, then filling them in one by one. Or you can use an automation tool that extracts company names from email domains in bulk.
Step 3: Standardize Name Formatting
Names like "john SMITH," "SARAH jones," or "david" with no last name are more common than you would think. They happen when contacts are imported from spreadsheets with inconsistent formatting, when form fields do not validate input, or when sales reps type quickly.
Bad name formatting affects personalization. When your email template says "Hi john," instead of "Hi John," it signals to the recipient that you do not actually know them — or that your systems are sloppy. It is a small thing that erodes trust.
The fix is straightforward: title case all first and last names. "john SMITH" becomes "John Smith." HubSpot does not have a built-in way to do this at scale, but you can export to a CSV, apply a title case formula in Excel or Google Sheets (the PROPER function), then re-import.
If you have HubSpot Operations Hub, you can set up a workflow that automatically formats names when a contact is created or updated. This prevents the problem from recurring.
Step 4: Standardize Phone Numbers
Phone numbers are entered in dozens of different formats: 5551234567, 555-123-4567, (555) 123-4567, +1 555 123 4567, and so on. Inconsistent phone formatting makes it hard to search for contacts by phone, causes issues with calling integrations, and looks unprofessional in reports.
Pick one standard format and stick with it. For US numbers, (555) 123-4567 is the most common business format. For international numbers, use the E.164 format: +15551234567.
Like name formatting, you can fix this with a CSV export, regex formulas, and re-import. Or you can set up an Operations Hub workflow that formats phone numbers automatically on contact creation.
Step 5: Remove or Re-engage Dead Contacts
Not every contact in your CRM is worth keeping. Contacts who have never opened an email, never visited your website, and have not engaged in over a year are dead weight. They inflate your contact count (which affects your HubSpot billing), hurt your email deliverability scores, and make your reports inaccurate.
Before deleting anyone, run a re-engagement campaign. Send a simple email: "We have not heard from you in a while. Are you still interested in hearing from us?" Anyone who does not engage within 30 days can be safely archived or deleted.
To find these contacts in HubSpot, create a list where Last marketing email open date is unknown AND Marketing emails delivered is greater than 5. This gives you contacts who have received multiple emails but never opened a single one.
Step 6: Set Up Ongoing Data Hygiene
Cleaning your CRM once is not enough. Data quality degrades continuously. New leads come in from forms with missing fields. People change jobs. Email addresses go stale. Sales reps enter data inconsistently.
The best approach is to schedule a monthly data audit. Set a recurring calendar reminder to check for duplicates, review contacts with missing critical fields, and clean up any formatting issues. If you can automate any of these checks, even better.
Some teams use HubSpot workflows to enforce data quality at the point of entry — requiring certain fields on forms, auto-formatting names and phones, and flagging contacts that are missing key properties. This is the gold standard, but it requires Operations Hub Professional or Enterprise.
Want to know your CRM health score?
CRM Health Scanner connects to your HubSpot in 30 seconds and scores your data quality out of 100. It finds duplicates, missing fields, formatting issues, and auto-enriches company names from email domains — completely free.
Get Your Free Health Score→Read-only access. No credit card required.