Spaces:
Running
Running
id: "duckdb-check-duplicate-rows" | |
title: "Check Duplicate Rows" | |
slug: "duckdb-check-duplicate-rows-query" | |
description: "Count the number of duplicate rows in a table using DuckDB." | |
code: | | |
-- Count duplicate rows in the 'train' table | |
SELECT COUNT(*) - COUNT(DISTINCT columns(*)) | |
FROM train; | |
# DuckDB Check Duplicate Rows Query | |
This snippet demonstrates how to count the number of duplicate rows in a DuckDB table using a SQL query. | |
```sql | |
-- Count duplicate rows in the 'train' table | |
SELECT COUNT(*) - COUNT(DISTINCT columns(*)) | |
FROM train; | |
``` | |
This query works by: | |
1. Counting all rows using `COUNT(*)` | |
2. Subtracting the count of distinct rows using `COUNT(DISTINCT *)` | |
3. The result is the number of duplicate rows | |
You can replace 'train' with the name of your specific table to check for duplicates in other tables. | |
Note: This query can be computationally expensive for large tables, as it needs to check all columns for uniqueness. | |