How to remove duplicate lines from a file in Perl?

Question

PerlCode · Accepted Answer

Removing Duplicate Lines from a File in Perl Removing duplicate lines from a file is a common text-processing task. In Perl, it’s straightforward to accomplish this using a hash to keep track of lines you’ve already seen. Hashes provide efficient lookups, perfect for this kind of deduplication. Here’s the general approach: Open the input file for reading. Read the file line by line. Use a hash to store lines that have already been encountered. If a line is new (not in the hash), print it (or save it). Perl Concepts Used %seen — a hash mapping lines to a true value to track duplicates chomp — removes trailing newline for clean comparison while ( ) { ... } — Convenient file reading loop, works with ARGV or STDIN Perl’s typical idiom of checking exists in a hash to identify duplicates Example: Remove Duplicate Lines from a File #!/usr/bin/perl use strict; use warnings; # Hash to store seen lines my %seen; # Read from standard input or files passed as arguments while (my $line = <>) { chomp($line); # remove newline for consistent comparison unless ($seen{$line}++) { # if not seen before print "$line
"; # print the unique line with newline restored } } This script can be used in

How to remove duplicate lines from a file in Perl?

Question

Removing Duplicate Lines from a File in Perl

Perl Concepts Used

Example: Remove Duplicate Lines from a File

Key Points and Potential Gotchas

Version Notes

Verified Code

Was this helpful?

Related Questions