mirror of
https://github.com/mohitmishra786/amILearningEnough.git
synced 2025-12-18 04:44:40 +03:00
101 lines
4.9 KiB
Markdown
101 lines
4.9 KiB
Markdown
# SED
|
|
|
|
This document provides a collection of useful one-line scripts for `sed`, the Unix stream editor. These commands are designed for quick manipulation and modification of text files directly from the command line.
|
|
|
|
## Text Transformations
|
|
|
|
* **Inserting Blank Lines:**
|
|
|
|
* `sed G`: Double-space a file by inserting a blank line after each line.
|
|
* `sed '/^$/d;G'`: Double-space a file, preserving existing single blank lines.
|
|
* `sed 'G;G'`: Triple-space a file.
|
|
* `sed '/regex/{x;p;x;}'`: Insert a blank line *before* each line matching `regex`.
|
|
* `sed '/regex/G'`: Insert a blank line *after* each line matching `regex`.
|
|
|
|
* **Removing Whitespace:**
|
|
|
|
* `sed 's/^[ \t]*//'`: Remove leading whitespace (spaces and tabs).
|
|
* `sed 's/[ \t]*$//'`: Remove trailing whitespace.
|
|
* `sed 's/^[ \t]*//;s/[ \t]*$//'`: Remove both leading and trailing whitespace.
|
|
|
|
* **Centering/Aligning Text:**
|
|
|
|
* `sed -e :a -e 's/^.\{1,78\}$/ &/;ta'`: Right-align text within a 79-character width.
|
|
* `sed -e :a -e 's/^.\{1,77\}$/ & /;ta'`: Center text within a 79-character width (method 1, leading spaces significant).
|
|
* `sed -e :a -e 's/^.\{1,77\}$/ &/;ta' -e 's/\( *\)\1/\1/'`: Center text (method 2, leading spaces discarded).
|
|
|
|
|
|
## Line Numbering and Counting
|
|
|
|
* `sed = filename | sed 'N;s/\n/\t/'`: Number each line, left-aligned, using tabs for spacing.
|
|
* `sed = filename | sed 'N; s/^/ /; s/ *\(.\{6,\}\)\n/\1 /'`: Number each line, left-aligned, right-justified numbers.
|
|
* `sed '/./=' filename | sed '/./N; s/\n/ /'`: Number only non-blank lines.
|
|
* `sed -n '$='`: Count the number of lines in a file (similar to `wc -l`).
|
|
|
|
## String Manipulation and Substitution
|
|
|
|
* `sed 's/foo/bar/'`: Replace the first instance of "foo" with "bar" on each line.
|
|
* `sed 's/foo/bar/g'`: Replace all instances of "foo" with "bar" on each line.
|
|
* `sed 's/\(.*\)foo/\1bar/'`: Replace only the last occurrence of "foo" with "bar" on each line.
|
|
* `sed 's/scarlet/red/g;s/ruby/red/g;s/puce/red/g'`: Replace multiple words ("scarlet", "ruby", "puce") with "red".
|
|
* `gsed 's/scarlet\|ruby\|puce/red/g'`: Same as above, using GNU sed's alternation feature.
|
|
* `sed '$!N;s/\n/ /'`: Join consecutive lines with a space. Useful for formatting or combining data.
|
|
* `gsed ':a;s/\B[0-9]\{3\}\>/,&/;ta'`: Insert commas into numbers (e.g., 1234567 becomes 1,234,567) using GNU sed.
|
|
|
|
|
|
## Line and Paragraph Manipulation
|
|
|
|
* `sed '1!G;h;$!d'`: Reverse the order of lines in a file (similar to `tac`).
|
|
* `sed -e :a -e '/\\$/N; s/\\\n//; ta'`: Join lines ending with a backslash with the next line. Useful for handling multi-line entries.
|
|
* `sed -n '/^$/{p;h;};/./{x;/./p;}'`: Delete the last line of each paragraph (paragraphs separated by blank lines).
|
|
|
|
## Content Filtering and Extraction
|
|
|
|
* `sed 10q`: Print only the first 10 lines (similar to `head`).
|
|
* `sed -n '8,12p'`: Print lines 8 through 12.
|
|
* `sed '52!d'`: Print only line 52.
|
|
* `sed -n '/regexp/p'`: Print only lines matching the regular expression `regexp` (similar to `grep`).
|
|
* `sed '/regexp/d'`: Delete lines matching the regular expression `regexp`.
|
|
* `sed -n '/Iowa/,/Montana/p'`: Print the block of lines starting with a line containing "Iowa" and ending with a line containing "Montana".
|
|
* `sed '/Iowa/,/Montana/d'`: Delete the block of lines between "Iowa" and "Montana".
|
|
* `sed '$!N; /^\(.*\)\n\1$/!P; D'`: Remove duplicate, consecutive lines (similar to `uniq`).
|
|
|
|
## Newline Conversion
|
|
|
|
* `sed 's/.$//'`: Convert DOS/Windows newlines (CRLF) to Unix newlines (LF). Assumes all lines end in CRLF.
|
|
* `sed 's/$/\r/'`: Convert Unix newlines (LF) to DOS/Windows newlines (CRLF).
|
|
|
|
## Special Applications
|
|
|
|
* `sed "s/.`echo \\\b`//g"`: Remove nroff overstrikes (often seen in older man pages).
|
|
* `sed '/^$/q'`: Extract the header of an email message (up to the first blank line).
|
|
|
|
|
|
## Sed Execution and Quoting
|
|
|
|
Sed commands can be piped or applied directly to files:
|
|
|
|
```bash
|
|
sed 'command' filename # Applies the command to the file
|
|
cat filename | sed 'command' # Equivalent command using pipes
|
|
sed 'command' filename > output # Redirects output to a new file
|
|
```
|
|
|
|
Use single quotes (`'...'`) around `sed` commands to prevent shell interpretation of special characters like `$` and backticks. Double quotes are usually required in DOS versions of `sed`.
|
|
|
|
|
|
## Performance Optimization
|
|
|
|
For large files, using an address range or the `q` command can dramatically improve speed. For example:
|
|
|
|
```bash
|
|
sed -n '51q;45,50p' filename # Prints lines 45-50 and quits, much faster for large files.
|
|
```
|
|
|
|
Placing the pattern before the `s` command in substitutions can also improve performance:
|
|
|
|
```bash
|
|
sed '/foo/s/foo/bar/g' filename # Faster than sed 's/foo/bar/g' filename
|
|
```
|
|
|
|
This cheat sheet provides a starting point for using `sed`. For more advanced usage and a deeper understanding of regular expressions, consult the `sed` man page (`man sed`) and dedicated resources on regular expressions. |