FFmpeg Wishlist

From MultimediaWiki
Jump to navigation Jump to search

Temporary FFmpeg wish/todo list:

Decoders

  • H.264 (MPEG-4 AVC) decoder improvements/enhancements:
    • Add PAFF to the existing H.264 decoder
    • Assembly optimizations
  • ALAC decoder improvements/enhancements:
    • Clean up the existing alac decoder code
  • ffsvq3 decoder improvements/enhancements:
    • add b-frame support to the ffsvq3 decoder
  • amr decoder
  • dv decoder
  • integrate speex (glue code or native)
  • g723.1/rtp decoder
  • g729/rtp decoder
  • Bethsoft VID decoder
  • Monkey's Audio decoder, look at the C++ SDK sources
  • JPEG2000 decoder
  • gsm decoder
  • QCELP decoder spec is c.s0020 and source is c.r0020
  • AMV decoder, http://scrub50187.com/ has the creator. wikipedia has articles about the format also.
  • integer only vorbis decoder (to replace tremor)
  • LGPL'ed LE-AAC and HE-AAC decoder
    • Also add a aac parser so -acodec copy to mp4/mov will work
  • aacPlus (a.k.a. AAC+) decoder (Note: aacPlus v1 is HE-AAC + SBR, aacPlus v2 is HE-AAC + SBR + PS. Standard HE-AAC decoders can decode aacPlus encoded files/streams but without SBR and PS you do not get the full quality. By adding a SBR decoder and a PS decoder to FFmpeg and coupling it with an existing HE-AAC decoder you could playback aacPlus at full quality? A SBR decoder could be shared with a mp3PRO decoder as mp3PRO also uses SBR).
  • mp3PRO decoder (Note: mp3PRO is MP3 + SBR. Standard MP3 decoders can decode mp3PRO encoded files/streams but without SBR you do not get the full quality. By adding a SBR decoder to FFmpeg and coupling it with the existing MP3 decoder you could playback mp3PRO at full quality? A SBR decoder could be shared with a aacPlus decoder as aacPlus also uses SBR).
  • MPEG Surround decoder/parser (for all audio but especially MP3/mp3PRO and AAC/aacPlus as those are in use today). MPEG Surround technology share similar characteristics with SBR (Spectral Band Replication) and PS (Parametric Stereo), which mp3PRO and aacPlus also use, so if SBR and PS decoders was added to FFmpeg then those could probebly share common code with a MPEG Surround decoder/parser.
  • Add j-type picture support to wmv8 decoder
  • MLP decoder
  • Indeo 4 decoder and Indeo 5 decoder
  • VNC decoder, files created by vncrec. Re-use code from VMware Video decoder http://www.sodan.org/~penny/vncrec/
  • Additional game formats support:

Encoders

  • Snow
    • multiple reference frames improvements
      • decide which frames to keep (e.g. long-term refs)
      • some changes to the mv prediction code
    • non-translational motion-compensation
      • estimate non translational parameters per block by using surrounding motion vectors
      • add a ac coded bit per block to switch between translational and non-translational MC
      • borrow the non translational MC code from libmpcodecs/vf_perspective.c
      • some changes to the encoder to decide between translational and non t.
    • Trellis quantization (select quantized coefficient so as to minimize the rate distrortion
    • 4x4 sized block support (we have 16x16 and 8x8 currently)
    • 1/8 pel motion compensation / estimation support (pretty much just encoder changes needed which in case of the iterative me should be trivial)
    • improve the intra color decision
  • dv encoder
  • integrate speex, glue code or native


Demuxers


Muxers

  • DVB (MPEG-TS) muxer inside DVB containers
    • MPEG-1/2 video-streams inside DVB containers
    • MPEG-4 ASP video-streams inside DVB containers
    • MPEG-4 AVC (H.264) video-streams inside DVB containers
    • AC3 audio-streams inside DVB containers
      • Mutiple AC3 audio-streams inside DVB containers
    • MP3 audio-streams inside DVB containers
      • Mutiple MP3 audio-streams inside DVB containers
  • NSV muxer
  • NSA muxer

DirectShow and DirectX and MediaFoundation

Native DirectShow support

Decoder/encoder/demuxer/muxer and post-processing filter API for Windows by Microsoft

Native MediaFoundation support

  • Microsoft Media Foundation API usage for optimized digital media playback on Microsoft Windows Vista™
  • Multimedia Class Scheduler Service (MMCSS) class
  • Enhanced Video Renderer (EVR) class
  • Streaming Audio Renderer (SAR) class

DirectX Video Acceleration (DXVA) 1.0 AND 2.0 for video decoding under Windows

Note! For this native DirectShow support in FFmpeg is first needed

http://download.microsoft.com/download/5/b/9/5b97017b-e28a-4bae-ba48-174cf47d23cd/MED134_WH06.ppt

Features


Subtitles

  • Create a common 'subtitles parser library' (and/or an API system for adding support for additional subtitle formats?) - a common sub-library to FFmpeg with all subtile decoders/demuxers/parsers gathered (similar to the libpostproc and libavutils). Call it "libsubs" (or "libsub", "libsubtitles" or whatever). Move FFmpeg's existing VobSub and DVBsub code there, so no matter if they are bitmap or text-based subs all existing and future subtile code is collected there. This will help reduce future code replication by sharing common code, thus making it easier to add support for additional subtitles.
    • Maybe use MPlayer's recently added "libass" (SSA/ASS subtile reader) as a base for such a common library?
  • Support for advanced SSA/ASS rendering
    • Possible source are libass or the asa library
  • Support bold, italic, underline, RGB colors, size changes and font changes for a whole line or part of one line
  • Line 23 signal (a.k.a. "Wide-screen signal") detecting and use for DVD-Video (VobSub)
  • Support for the subtitles HTML tags
  • Capability of displaying subtitles with no video enabled (for example for audio-books)
  • Support for Karaoke subtitles (for kar and cdg, etc.)
  • Dual-subtitle-display (display two subtitles/languages at the same time, one at the bottom as normal plus one at the top of the screen)
  • Capability of moving the subtitles in the picture (freetype renderer)
  • Support more subtitle formats (text and bitmap-based):
    • Closed captioning (CC) subtile support - (Closed captions for the deaf and hard of hearing, also known as "Line 21 captioning", uses VobSub bitmaps)
      • xine have a SPU decoder for subpictures and Closed Captions software decoding
    • DirectVobSub (VSFilter) - standard VobSubs (DVD-Video subtitles) embedded in AVI containers
    • DivX Subtitles (XSUB) display/reader/decoder (Note: bitmap based subtitle, similar to VobSub)
    • SubRip (.srt) subtile support (Note: simple text-based based subtitle with timestamp)
    • Subviewer (.sub) subtile support (Note: simple text-based based subtitle with timestamp)
    • MicroDVD (.sub) subtile support (Note: simple text-based based subtitle with timestamp
    • Sami (.smi) subtile support (Note: simple text-based based subtitle with timestamp)
    • SubStation Alpha (.ssa+.ass) subtile support (Note: advanced text-based based subtitle with timestamps and XY location on screen)
    • RealText (.rt) subtile support
    • PowerDivx (.psb) subtile support
    • Universal Subtitle Format (.usf) subtile support
    • Structured Subtitle Format (.ssf) subtile support


Misc

  • Add xing and/or vbri header parsing support to the mp3 decoder/parser
  • Add GAIN (MP3Gain) header parsing support to the MP3 decoder/parser
    • Also add GAIN (AACGain) header parsing support to the AAC decoder/parser
  • Add a aac parser so -acodec copy to mp4/mov works
  • Clean up the h263 rtp patch found on this page: http://www.salyens.com/downloads/index.html#ffmpeg-0.4.7
  • Add nice error messages to the flv demuxer(Nelly Moser)


Streaming Media Network Protocols

  • Create a common 'stream demuxer/parser library' for the client-side (and/or API for adding support for additional streaming formats?) - a LGPL'ed sub-library in FFmpeg with all stream demuxers/parsers gathered (similar to the libpostproc and libavutil). Call it "libstream" (or "stream" or whatever). Move FFmpeg's existing stream code there like HTTP and RTSP/RTP. This will help reduce future code replication by sharing common code, thus making it easier to add support for additional streaming formats. All togther making it super easy for audio/video players using FFmpeg to add all-in-one streaming support to their player.
    • Maybe use either MPlayer's "stream" library structure, LIVE555, or probebly the better libnms (from NeMeSi) as a base for such a common library?
  • Add support for additional streaming protocols (on the client side) and improve/enhance support for existing protocols:
    • HTTP (Hypertext Transfer Protocol) client
    • UDP (User Datagram Protocol) client
    • RTSP - Real-Time Streaming Protocol (RFC2326) client
    • RTP/RTCP - Real-Time Transport Protocol/RTP Control Protocol (RFC3550) client
    • RTP Profile for Audio and Video Conferences with Minimal Control (RFC3551) client
    • RealMedia RTSP/RDT (Real Time Streaming Protocol / Real Data Transport) client
    • SDP (Service Discovery Protocol) / SSDP (Simple Service Discovery Protocol) client
    • MMS (Microsoft Media Services) client


Audio and video (pre-process/post-process) filters

  • Adopt MPlayer's A/V filter system or create new one.
  • Decide on name of a such A/V filter API.
    • libavfilter (conflicts with LAVF)? libavmunge?
  • Create (or port) additional pre-process and post-process video filters:
    • General post-proc sources are MPlayer (libmpcodecs vf_*.c filters) and FFdshow
    • SSP (Statistical-Post-Processing)
    • DeBlocking
    • DeRinging
    • Sharpen / UnSharpen (Soften)
    • ReQuantization
    • Auto-Luminance
    • Blurring / DeNoising
    • Deinterlace (weave AND bob) filters
    • 2:3 pull-down / ivtc (inverse telecine) for 24 progressive-frames on 30 FPS TV's
    • NTSC => PAL, and PAL => NTSC frame-rate (FPS) adjust and reclock filter for NTSC <=> PAL conversion
      • NTSC <=> PAL frame-rate adjust FPS ratios?: 23.97 <=> 25, 24 <=> 25, 30 <=> 25, 25 <=> 30
  • Create (or port) additional pre-process and post-process audio filters: