-
Notifications
You must be signed in to change notification settings - Fork 2
/
Copy pathsed.html
199 lines (190 loc) · 7.73 KB
/
sed.html
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN"
"http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
<html xmlns="http://www.w3.org/1999/xhtml">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8" />
<title>Bash tricks with find and sed — bits v0.8 documentation</title>
<link rel="stylesheet" href="_static/sphinxdoc.css" type="text/css" />
<link rel="stylesheet" href="_static/pygments.css" type="text/css" />
<script type="text/javascript">
var DOCUMENTATION_OPTIONS = {
URL_ROOT: '',
VERSION: '0.8',
COLLAPSE_INDEX: false,
FILE_SUFFIX: '.html',
HAS_SOURCE: true
};
</script>
<script type="text/javascript" src="_static/jquery.js"></script>
<script type="text/javascript" src="_static/underscore.js"></script>
<script type="text/javascript" src="_static/doctools.js"></script>
<link rel="top" title="bits v0.8 documentation" href="index.html" />
<link rel="prev" title="Setting and reading individual bit of a integer" href="bits.html" />
</head>
<body>
<div class="related">
<h3>Navigation</h3>
<ul>
<li class="right" style="margin-right: 10px">
<a href="genindex.html" title="General Index"
accesskey="I">index</a></li>
<li class="right" >
<a href="py-modindex.html" title="Python Module Index"
>modules</a> |</li>
<li class="right" >
<a href="bits.html" title="Setting and reading individual bit of a integer"
accesskey="P">previous</a> |</li>
<li><a href="index.html">bits v0.8 documentation</a> »</li>
</ul>
</div>
<div class="sphinxsidebar">
<div class="sphinxsidebarwrapper">
<h4>Previous topic</h4>
<p class="topless"><a href="bits.html"
title="previous chapter">Setting and reading individual bit of a integer</a></p>
<h3>This Page</h3>
<ul class="this-page-menu">
<li><a href="_sources/sed.txt"
rel="nofollow">Show Source</a></li>
</ul>
<div id="searchbox" style="display: none">
<h3>Quick search</h3>
<form class="search" action="search.html" method="get">
<input type="text" name="q" size="18" />
<input type="submit" value="Go" />
<input type="hidden" name="check_keywords" value="yes" />
<input type="hidden" name="area" value="default" />
</form>
<p class="searchtip" style="font-size: 90%">
Enter search terms or a module, class or function name.
</p>
</div>
<script type="text/javascript">$('#searchbox').show(0);</script>
</div>
</div>
<div class="document">
<div class="documentwrapper">
<div class="bodywrapper">
<div class="body">
<div class="section" id="bash-tricks-with-find-and-sed">
<h1>Bash tricks with <em>find</em> and <em>sed</em><a class="headerlink" href="#bash-tricks-with-find-and-sed" title="Permalink to this headline">¶</a></h1>
<p>This article shows two <em>bash</em> tricks with <em>find</em> and <em>sed</em>. The
problem at hand is computing the number of lines in multiple files
in a directory. There is no subdirectory but the files have a one line
header that should not be part of the count.</p>
<ul class="simple">
<li><strong>sed</strong> will help operating on each file to omit the first line,</li>
<li><strong>find</strong> will help executing the sed command on every files of a
directory</li>
</ul>
<p>Let’s create a directory examples with two files:</p>
<div class="highlight-bash"><pre>~$ mkdir papa && cd papa
~$
~$ cat > tata # Type Ctrl-D to stop editing
# header
1
2
3
4
~$ cat > titi
# HEAD
1
2
~$ cat titi
# HEAD
1
2</pre>
</div>
<p><em>sed</em> operates on lines which are specified with an address range:
<em>first,last</em>. <em>sed</em> counts from one, not zero. The special
character <tt class="docutils literal"><span class="pre">$</span></tt> means the number of the <em>last line</em>. So to operate on
every line except the first one, the address range is <tt class="docutils literal"><span class="pre">'2,$'</span></tt>. Do
not forget the single quotes to prevent bash from messing with the
<tt class="docutils literal"><span class="pre">$</span></tt>.</p>
<p>Now that a range is specified, a command must be given: in our case,
it is the command <tt class="docutils literal"><span class="pre">p</span></tt> (for print).</p>
<div class="highlight-sh"><div class="highlight"><pre>~<span class="nv">$ </span>sed <span class="s1">'2,$p'</span> tata
<span class="c"># header</span>
1
1
2
2
3
3
4
4
</pre></div>
</div>
<p>One more thing, <em>sed</em>‘s default behavior is to print every line it
meets, regardless of what else it may do with it. The <tt class="docutils literal"><span class="pre">-n</span></tt> option
deactivates this (a.k.a. <tt class="docutils literal"><span class="pre">--silent</span></tt>):</p>
<div class="highlight-sh"><div class="highlight"><pre>~<span class="nv">$ </span>sed -n <span class="s1">'2,$p'</span> tata
1
2
3
4
~<span class="nv">$ </span>sed --silent <span class="s1">'2,$p'</span> tata
1
2
3
4
</pre></div>
</div>
<p>Fine for <em>sed</em>, now on to <em>find</em>. <em>find</em> can be restricted to <em>find</em>
only normal files with the option <tt class="docutils literal"><span class="pre">-type</span> <span class="pre">f</span></tt>, so that no directory
gets shown.</p>
<div class="highlight-sh"><div class="highlight"><pre>~<span class="nv">$ </span>find
.
./tata
./titi
~<span class="nv">$ </span>find -type f
./tata
./titi
</pre></div>
</div>
<p>Now, for each of these files, the previous <em>sed</em> command must be
EXECuted. <em>find</em> has the <tt class="docutils literal"><span class="pre">-exec</span></tt> option whose value is a command to
be executed for each file found. The name of the file found is
inserted here in the command with the <tt class="docutils literal"><span class="pre">'{}'</span></tt> pattern. A <tt class="docutils literal"><span class="pre">;</span></tt>
<em>terminator</em> character must be written at the end of the command so
that <em>find</em> knows when the command ends. The semi-colon must be
protected from mangling from bash with either <tt class="docutils literal"><span class="pre">\;</span></tt> or <tt class="docutils literal"><span class="pre">';'</span></tt>.</p>
<div class="highlight-sh"><div class="highlight"><pre>~<span class="nv">$ </span>find -type f -exec <span class="nb">echo</span> <span class="s2">"I found {}"</span> <span class="se">\;</span>
I found ./tata
I found ./titi
</pre></div>
</div>
<p>Now here is the command to count the lines of every file in the directory,
while omitting the header:</p>
<div class="highlight-sh"><div class="highlight"><pre>~<span class="nv">$ </span>find -type f -exec sed -n <span class="s1">'2,$p'</span> <span class="s1">'{}'</span> <span class="se">\;</span> | wc -l
6
</pre></div>
</div>
</div>
</div>
</div>
</div>
<div class="clearer"></div>
</div>
<div class="related">
<h3>Navigation</h3>
<ul>
<li class="right" style="margin-right: 10px">
<a href="genindex.html" title="General Index"
>index</a></li>
<li class="right" >
<a href="py-modindex.html" title="Python Module Index"
>modules</a> |</li>
<li class="right" >
<a href="bits.html" title="Setting and reading individual bit of a integer"
>previous</a> |</li>
<li><a href="index.html">bits v0.8 documentation</a> »</li>
</ul>
</div>
<div class="footer">
© Copyright 2009, Jean Daniel Browne.
Created using <a href="http://sphinx.pocoo.org/">Sphinx</a> 1.0.4.
</div>
</body>
</html>