FFmpeg Wishlist

From MultimediaWiki
Jump to navigation Jump to search

A temporary FFmpeg wish/todo list:

Decoders

Demuxers

libstream (a common 'stream' library)

  • Create a common 'stream demuxer/parser library' (and/or API for adding support for additional streaming formats?) - a LGPL'ed sub-library in FFmpeg with all stream demuxers/parsers gathered (similar to the libpostproc and libavutil). Call it "libstream" (or "stream" or whatever). Move FFmpeg's existing stream code there like HTTP and RTSP/RTP. This will help reduce future code replication by sharing common code, thus making it easier to add support for additional streaming formats. All togther making it super easy for audio/video players using FFmpeg to add all-in-one streaming support to their player.
    • Maybe use either MPlayer's "stream" library, LIVE555, or probebly the better libnms (from NeMeSi) as a base for such a common library?
    • HTTP (Hypertext Transfer Protocol)
    • UDP (User Datagram Protocol)
    • RTSP - Real-Time Streaming Protocol (RFC2326)
    • RTP/RTCP - Real-Time Transport Protocol/RTP Control Protocol (RFC3550)
    • RTP Profile for Audio and Video Conferences with Minimal Control (RFC3551)
    • RealMedia RTSP/RDT (Real Time Streaming Protocol / Real Data Transport) proprietary transport protocol developed by RealNetworks to stream RealVideo/RealAudio
    • SDP (Service Discovery Protocol) / SSDP (Simple Service Discovery Protocol)
    • MMS (Microsoft Media Services)

Features

Subtitles

  • Create a common 'subtitles parser library' (and/or API for adding support for additional subtitle formats?) - a common sub-library to FFmpeg with all subtile decoders/demuxers/parsers gathered (similar to the libpostproc and libavutils). Call it "libsubs" (or "libsub", "libsubtitles" or whatever). Move FFmpeg's existing VobSub and DVBsub code there, so no matter if they are bitmap or text-based subs all existing and future subtile code is collected there. This will help reduce future code replication by sharing common code, thus making it easier to add support for additional subtitles.
    • Maybe use MPlayer's recently added "libass" (SSA/ASS subtile reader) as a base for such a common library?
  • Support for advanced SSA/ASS rendering using libass or asa or any other mean,
  • Support bold, italic, underline, RGB colors, size changes and font changes for a whole line or part of one line, (not all of these are supported now)
  • Line 23 signal (a.k.a. "Wide-screen signal") detecting and use for DVD-Video (VobSub)
  • Support for the subtitles HTML tags
  • Capability of displaying subtitles with no video enabled (for example for audio-books)
  • Support for Karaoke subtitles (for kar and cdg, etc.)
  • Capability of moving the subtitles in the picture (freetype renderer)
  • Support more subtitle formats (text and bitmap-based):
    • Closed captioning (CC) subtile support - (Closed captions for the deaf and hard of hearing, also known as "Line 21 captioning", uses VobSub bitmaps)
    • DirectVobSub (VSFilter) - standard VobSubs (DVD-Video subtitles) embedded in AVI containers
    • DivX Subtitles (XSUB) display/reader/decoder (Note: bitmap based subtitle, similar to VobSub)
    • SubRip (.srt) subtile support (Note: simple text-based based subtitle with timestamp)
    • Subviewer (.sub) subtile support (Note: simple text-based based subtitle with timestamp)
    • MicroDVD (.sub) subtile support (Note: simple text-based based subtitle with timestamp
    • Sami (.smi) subtile support (Note: simple text-based based subtitle with timestamp)
    • SubStation Alpha (.ssa+.ass) subtile support (Note: advanced text-based based subtitle with timestamps and XY location on screen)
    • RealText (.rt) subtile support
    • PowerDivx (.psb) subtile support
    • Universal Subtitle Format (.usf) subtile support
    • Structured Subtitle Format (.ssf) subtile support

Misc

Snow

  • multiple reference frames improvements
    • decide which frames to keep (e.g. long-term refs)
    • some changes to the mv prediction code
  • non translational motion compensation
    • estimate non translational parameters per block by using surrounding motion vectors
    • add a ac coded bit per block to switch between translational and non-translational MC
    • borrow the non translational MC code from libmpcodecs/vf_perspective.c
    • some changes to the encoder to decide between translational and non t.
  • Trellis quantization (select quantized coefficient so as to minimize the rate distrortion
  • 4x4 sized block support (we have 16x16 and 8x8 currently)
  • 1/8 pel motion compensation / estimation support (pretty much just encoder changes needed which in case of the iterative me should be trivial)
  • improve the intra color decision


A/V Filters

  • Adopt MPlayer's filter system or create new one.
  • Decide on name of a such filter API.
    • libavfilter (conflicts with LAVF)? libavmunge?
  • Create (or port) additional pre-process and post-process video filters:
    • General post-proc sources are MPlayer (libmpcodecs vf_*.c filters) and FFdshow
    • SSP (Statistical-Post-Processing)
    • DeBlocking
    • DeRinging
    • Sharpen / UnSharpen (Soften)
    • ReQuantization
    • Auto-Luminance
    • Blurring / DeNoising
    • Deinterlace (weave AND bob) filters
    • 2:3 pull-down / ivtc (inverse telecine) for 24 progressive-frames on 30 FPS TV's
    • NTSC => PAL, and PAL => NTSC frame-rate (FPS) adjust and reclock filter for NTSC <=> PAL conversion
      • NTSC <=> PAL frame-rate adjust FPS ratios?: 23.97 <=> 25, 24 <=> 25, 30 <=> 25, 25 <=> 30
  • Create (or port) additional pre-process and post-process audio filters: