debian/transcode/transcode-1.1.7/docs/README.filter.modfps


1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324


I. Introduction

This file contains some example commands, and scripts which I've
found useful when working with modfps.  Additionally there's some
general documentation on how to use it.

modfps is a filter written for transcode which allows one to alter
the frame rate by either dropping or cloning frames.  While one might
initially think that there's a better more intelligent way to alter
the frame rate, most things I've toyed with which seriously compromise
the original frames look poor.

2. Options

first of all, I expect that you've run "tcmodinfo -i modfps" ... that's
a good way to at least find out about the options.  I'll try giving
some more explanation for each option.

infps, infrc are depreciated, and I can't immediately think why they
might be used.

verbose: for each frame, some information about the time the frames
	sit in the stream, and whether a frame is skipped/cloned will
	be printed.  This is info is probably not useful except for
	debugging.

mode :	There are two modes of operation, buffered (1) and fast (0).
	Buffered mode is the default and recommended mode of operation.
	While fast mode will get you the correct frame rate, depending
	on the video stream you're working with it may look very poor

	For example, if you have a 29.97fps stream recorded from your
	tv card, I've frequently found that it's really 23.976fps or
	25.0fps, and there are just duplicate frames.  If you use fast
	mode to convert the steam, your odds are one in 5 if you're
	going from 29.97 to 23.976 of randomly hitting the pattern
	such that the skipped frames are the duplicate frames.  If you
	miss, you'll skip a frame with action, which gives you a big
	jump, and soon after that they'll be the duplicate frame.  This
	gives one a bad headache if you try to watch to much of it on
	a monitor (I'm not sure if it might be acceptable output to
	a tv.

	Unless you really need the speed, I'd suggest sticking with
	mode 1.

buffer: The number of frames to buffer.  The default is 5, which is
	perfect when going from 29.97fps to 23.976.  As a note, while
	it might initially seem like a good idea to run with 100 buffers
	if you've got the RAM, note that modfps does not consider local
	effects when deciding which frame to skip/clone.  Thus, if one
	were going from 29.97fps to 15fps and there were 60 frames
	buffered, it could decide that all of the frames that it wants to
	skip are the last ones, which would mean that AV could be off
	by 1.5 seconds.

	A good formulae to decide on the number of buffers to use is:
	buffers = absolute value of ( input_fps/(input_fps-output_fps) )

subsample: When calculating the difference of frames, how many pixels do
	we look at?  If we looked at all of them, buffered mode would
	run slower, while not behaving too differently.  The current
	default is to run with subsample=32 , or to say in English, to
	compare every 32nd pixel.  on a 640x480 frame this gives 9600
	points of comparison. ... on the 640x240 frames I record from
	tv, it's 4800 points of comparison, and I've yet to see modfps
	choose a bad frame to skip, but perhaps your source is different
	enough to warrant setting this lower.

clonetype: Ah, the fun stuff.  When cloning a frame, modfps will let
	one do more than a simple copy of the data.  Suppose that the
	first cloned frame is frame 5.  Frame 5 and 6 of the import
	steam will correspond to frame 5 and 7 of the export steam.
	Frame 6 of the export steam will varry based on clonetype.

	0: no special behavior, export frame 6 will be the same as
	export frame 5.

	1: export frame 6 will be composed by alternating the rows
	from export frame 5 and 7, with the top row being from frame
	5 .  If the display device is interlaced, like a TV, this
	might look best (it's similar to how a 23.976 fps dvd is
	displayed at 29.97 fps; some of the frames have alternating
	rows from the frame previous and after).

	2: export frame 6 will be composed by alternating the rows
	from export frame 5 and 7, with the top row being from frame
	7 .  This is similar to value 1, except the order of the
	frames is switched.  I'm not sure if this would be useful, but
	it was trivial to add.

	3: every pixel of frame 6 will be an arithmetic average of
	the corresponding pixel from frame 5 and 7.

	4: First we examine where the cloned frame sits in the export
	stream, and compare it to where it came from in the import
	steam.  some of the time, the following frame is closer to
	"correct" based on timestamps.  If so, frame 6 is a copy of
	frame 7.

	in other cases, the frame might be temporally weighted closer
	to frame 5, or between 7.  In such cases, we assign a weight
	to frame 5 and 7 such that the sum of the weights are 1.
	Then each pixel of frame 6 will be the sum of the weight of
	the frame times the pixel of the frame.  I.E. if the temporal
	weight is closest to frame 5, frame 6 will look mostly like
	frame 5, but will have some info from frame 7 in it.

	I think for non-interlaced displays this or value 5 will give
	the best results.

	5: for this, I considered how a TV looks, with the bright
	pixels "sticking around" better than the less bright areas.
	This is currently only implemented for YUV (and it will
	probably stay that way).

	An approximate ordering of the speed of the different clone
	types is: 0, 1, 2, 3, 4, 5 from fastest to slowest.   In my
	opinion, modes 0-4 are fast enough to not worry about speed,
	while mode 5 is slow enough that you should test and make
	sure you like the results before doing a lot with it.


3. Misc.

Because transcode by default will buffer some (5) frames, whenever
one isn't encoding an entire file (I.E. using "-c" "-J cut" or
"-J skip").

As of transcode 0.6.9 filters do not see frames which are not in
the range of "-c" ... thus if one is cutting frames 1000-1500 the
first frame modfps sees is 1000 .  As modfps buffers the frame it
sets the TC_FRAME_IS_SKIPPED flag, so it won't be encoded.  After
it's read in the buffers it wants, it will start outputting the
frames it's buffered.  I.E. the first frame modfps doesn't set as
skipped will be 1005 from the encoder's point of view while
corresponding to frame 1000 from the import stream.

When using -c ,  one should add the number of buffers to the last
segment, because all of the frames are offset by a bit.

I.E. if: "-c 3916-13090,17493-24185,28304-42212,46777-54710" gives
you the segments you want without modifying the frame rates, when
you use modfps=buffers=5 (the default) you should modify the option
to "-c 3916-13090,17493-24185,28304-42212,46777-54715"

Also of interest is that ivtc works as well with modfps as it
does with decimate.  While there's no real benefit to use it when
transcoding from 29.97 to 23.976 fps I've seen 29.97 video which has
telecine effects, except it's at 25.0 fps ... "-J ivtc,modfps=buffers=7"
works well in this case.

When using -J cut, cut will alter the number of audio frames skipped
so that "-J cut=1000-1500,modfps -f 29.97 --export_fps 23,1" will keep
the AV in sync while -c wouldn't.  However, again all the frames in
the export are off by the number of buffers.  But the effects are
different from -c , so if you wanted to use
"-J cut="3916-13090,17493-24185,28304-42212" you would then use
"-J cut="3921-13095,17498-24190,28309-42217"

4. Example commands

the source video of the example was recorded from my tv card, with
half the height of the source (to avoid interlacing effects, and
because I can't encode 640x480 realtime) .  Because of this, the
video I get is usually either 29.97fps (10%) 23.976 (65%) 25.0 (25%).
The percentages correspond to roughly how often I come upon this
source.  As you can see, I rarely have something that will look
good without modification, and even then, decimate wouldn't always
save me.

First, I do a first pass of xvid, video only.

/usr/bin/nice -n +19 /usr/local/test/bin/transcode --print_status \
5 --export_fps 25.0,3 -x mplayer -J modfps=buffer=7,hqdn3d -V -i tv.avi \
-b 64 -c 3916-13090,17493-24185,28304-42212,46777-54715 -y xvid,null \
-j 0,8 -Y 0,-4 -B 0,39,08 -w 600 -R 1

Next, I do the 2nd pass, again, video only.

/usr/bin/nice -n +19 /usr/local/test/bin/transcode --print_status 5 \
--export_fps 25.0,3 -x mplayer -J modfps=buffer=7,hqdn3d -V -i tv.avi \
-b 64 -c 3916-13090,17493-24185,28304-42212,46777-54715 -y xvid,null \
-j 0,8 -Y 0,-4 -B 0,39,08 -w 600 -R 2 -o tmp.tv.avi

Now that I have the video, I examine audio.  I've found that since
commercials are much louder than TV, before I give anything to filter
astat, I'll rip the raw audio.  Note, that the values of -c are
different for audio than they were for video.

I wish to stress: Currently, one cannot use "-c" with transcode and
process audio and video at the same time while changing the frame
rate without losing AV sync.

/usr/bin/nice -n +19 /usr/local/test/bin/transcode --print_status \
500 -f 29.970 --export_fps 25.000 -u 35 -x null,auto -y null,wav -m \
sound.tv.avi -c 3267-10919,14593-20175,23611-35212,39020-45637 -i tv.avi

Then we do a quick pass thru with filter astat.

/usr/bin/nice -n +19 /usr/local/test/bin/transcode --print_status 500 \
-f 23.976,1 -p sound.tv.avi -J astat=alog -u 50

Finally, I encode the audio while using passthru mode to merge it with
video.

/usr/bin/nice -n +19 /usr/local/test/bin/transcode --print_status 50 \
-i tmp.tv.avi -o out.tv.avi -p sound.tv.avi -s `cat alog` -P1 -b 64 \
-y raw

Please note in the above example that modfps was only used to handle
changing the rate of the video, and that one must give different
values to -c (as of transcode 0.6.9)

5. Possibly useful scripts

The following scripts are what I use to calculate my cut sections
which are given to transcode.  I use avidemux v2 on input avi's
to cut out commercials from a video stream I had recorded on tv.
I could use avidemux to save the avi, however some of the sections
well be reencodded, and it's a semi-slow process (taking a few
minutes, meaning that it's highly interruptive) so I prefer to just
have transcode take a bit longer to skip over the commercials.
instead I save the edit list, and have my encoding scripts cat
the edit list thru two commands.  The first gives the "-c" values
for video, and the 2nd handles it for audio.

----- BEGIN PERL SCRIPT -----
#!/usr/bin/perl

$numbuffers = 5;
$i = 0;
$argc = @ARGV;

# we take one argument: the number of buffers
if ($argv > 0){
  $numbuffers = $ARGV[0];
}

while(<STDIN>){
  if(/^Start : (\d+)/){
    # even tho initially some frames will be skipped, with newest
    # transcode, we keep the beginning where it should be, but we must
    # add 5 frames to the very last segment.
    $c[$i]{start} = 1 + $1;
    $_ = <STDIN>;
    if(/^Size : (\d+)/){
      $c[$i]{end} = $c[$i]{start}+$1;
      $i++;
    } else {
      print;
      die "ergh, we were supposed to find the end\n";
    }
  }
}

if($i > 0){
  $c[$i-1]{end} += $numbuffers ;
} else {
  print "-c 0";
  exit -1;
}

print "-c ";
for($j=0;$j<$i;$j++){
  print "$c[$j]{start}-$c[$j]{end}";
  if($i-1 != $j){
    print ",";
  }
}
print "\n";
----- END PERL SCRIPT -----

----- BEGIN PERL SCRIPT -----
#!/usr/bin/perl

# read from stdin the workbench

$infps = 5;
$outfps = 4;
$argc=@ARGV;
if ($argc>0){
  $infps=$ARGV[0];
}
if ($argc>1){
  $outfps = $ARGV[1];
}

use POSIX;
$i = 0;
while(<STDIN>){
  if(/^Start : (\d+)/){
    $c[$i]{start} = ceil((1+$1)*($outfps/$infps));
    $_ = <STDIN>;
    if(/^Size : (\d+)/){
      $c[$i]{end} = floor($c[$i]{start}+($1*$outfps/$infps));
      $i++;
    } else {
      print;
      die "ergh, we were supposed to find the end\n";
    }
  }
}
print "-c ";
for($j=0;$j<$i;$j++){
  print "$c[$j]{start}-$c[$j]{end}";
  if($i-1 != $j){
    print ",";
  }
}
print "\n";
----- END PERL SCRIPT -----

The first script takes one argument, the number of buffers
passed to modfps, and will default to 5, like modfps.  The 2nd
script takes two arguments, the import_fps and the export_fps.
Note that the intent of my choice of audio sections was not to
get audio outside of the cutzone (possibly large noise from the
very last bit of a commercial), and thus the video and audio
might end up encoding a different number of frames, and thus
with many cut sections, AV sync might be lost.  However, these
work for me ;)