I frequently have marketing lists of customers. Then a separate dataset that is always updating contains a subset of those customers who made a transaction. My routine task is to remove customers who appear on the transaction sheet from the the marketing sheet so we can keep marketing only to the accounts that have not made a transaction. Simple, right?
In excel, I merely copy the account #s of the customers on the transaction list; take it over to the marketing list and paste it at the bottom of the corresponding column of accounts#; conditional format that column of account numbers to highlight the duplicates; then sort by that formatting so the duplicates are all in one chunk at the top; then delete them; then delete the account numbers that I pasted from the transaction sheet and now I have a fresh marketing sheet that contains only accounts that have not made a transaction. Libre Office Calc can duplicate this step-for-step except sort by formatting. So if I have 200 transactions scattered across a marketing list of 10,000 lines, it’s a laborious task to delete them one by one, but I can’t seem to figure out how to get them to group up for easy deletion.
If your answer is “use Filter” – I have searched for this answer before posting this question – then please explain to me like I’m 5 how to use that because I have tried, and tried and tried – and even followed a youtube tutorial – and the “filter” concept and execution is still extremely puzzling to me.
Thanks!
edit: I agree “filter” is pretty stragithforward for other applications, but to use it to remove duplicates, it’s a really odd duck of a tool.
edit2: Conditional Formatting also identifies duplicates that the Filter function does not. Indeed, when I choose to copy the results of the filter, some of the duplicates have been removed but others are still there and, get this, still Conditionally Formatted. Which means the Conditional Formatting function identified duplicates that Filter for some reason did not. Is that worthy of a bug report?
edit3: removed the “Answered” green check due to this: Frequent super lag when sort by column of calculations