Mercurial > sdl-ios-xcode
view src/timer/macos/FastTimes.c @ 942:41a59de7f2ed
Here are patches for SDL12 and SDL_mixer for 4 or 6 channel
surround sound on Linux using the Alsa driver. To use them, naturally
you need a sound card that will do 4 or 6 channels and probably also a
recent version of the Alsa drivers and library. Since the only SDL
output driver that knows about surround sound is the Alsa driver,
you���ll want to choose it, using:
export SDL_AUDIODRIVER=alsa
There are no syntactic changes to the programming API. No new
library calls, no differences in arguments.
There are two semantic changes:
(1) For library calls with number of channels as an argument, formerly
you could use only 1 or 2 for the number of channels. Now you
can also use 4 or 6.
(2) The two "left" and "right" arguments to Mix_SetPanning, for the
case of 4 or 6 channels, no longer simply control the volumes of
the left and right channels. Now the "left" argument is converted
to an angle and Mix_SetPosition is called, and the "right" argu-
ment is ignored.
With two exceptions, so far as I know, the modified SDL12 and
SDL_mixer work the same way as the original versions, when opened for
1 or 2 channel output. The two exceptions are bugs which I fixed.
Well, the first, anyway, is a bug for sure. When rate conversions up
or down by a factor of two are applied (in src/audio/SDL_audiocvt.c),
streams with different numbers of channels (that is, mono and stereo)
are treated the same way: either each sample is copied or every other
sample is omitted. This is ok for mono, but for stereo, it is frames
that should be copied or omitted, where by "frame" I mean a portion of
the stream containing one sample for each channel. (In the SDL source,
confusingly, sometimes frames are called "samples".) So for these
rate conversions, stereo streams have to be treated differently, and
they are, in my modified version.
The other problem that might be characterized as a bug arises
when SDL_mixer is passed a multichannel chunk which does not have an
integral number of frames. Due to the way the effect_position code
loops over frames, when the chunk ends with a partial frame, memory
outside the chunk buffer will be accessed. In the case of stereo,
it���s possible that because malloc may give more memory than requested,
this potential problem never actually causes a segment fault. I don���t
know. For 6 channel chunks, I do know, and it does cause segment
faults.
If SDL_mixer is passed defective chunks and this causes a segment
fault, arguably, that���s not a bug in SDL_mixer. Still, whether or not
it counts as a bug, it���s easy to protect against, so why not? I added
code in mixer.c to discard any partial frame at the end of a chunk.
Then what about when SDL or SDL_mixer is opened for 4 or 6 chan-
nel output? What happens with the parts of the current library
designed for stereo? I don���t know whether I���ve covered all the bases,
but I���ve tried:
(1) For playing 2 channel waves, or other cases where SDL knows it has
to match up a 2 channel source with a 4 or 6 channel output, I���ve
added code in SDL_audiocvt.c to make the necessary conversions.
(2) For playing midis using timidity, I���ve converted timidity to do 4
or 6 channel output, upon request.
(3) For playing mods using mikmod, I put ad hoc code in music.c to
convert the stereo output that mikmod produces to 4 or 6 chan-
nels. Obviously it would be better to change the mikmod code to
mix down into 4 or 6 channels, but I have a hard time following
the code in mikmod, so I didn���t do that.
(4) For playing mp3s, I put ad hoc code in smpeg to copy channels in
the case when 4 or 6 channel output is needed.
(5) There seems to be no problem with .ogg files - stereo .oggs can be
up converted as .wavs are.
(6) The effect_position code in SDL_mixer is now generalized to in-
clude the cases of 4 and 6 channel streams.
I���ve done a very limited amount of compatibility testing for some
of the games using SDL I happen to have. For details, see the file
TESTS.
I���ve put into a separate archive, Surround-SDL-testfiles.tgz, a
couple of 6 channel wave files for testing and a 6 channel ogg file.
If you have the right hardware and version of Alsa, you should be able
to play the wave files with the Alsa utility aplay (and hear all
channels, except maybe lfe, for chan-id.wav, since it���s rather faint).
Don���t expect aplay to give good sound, though. There���s something
wrong with the current version of aplay.
The canyon.ogg file is to test loading of 6 channel oggs. After
patching and compiling, you can play it with playmus. (My version of
ogg123 will not play it, and I had to patch mplayer to get it to play
6 channel oggs.)
Greg Lee <greg@ling.lll.hawaii.edu>
Thus, July 1, 2004
author | Sam Lantinga <slouken@libsdl.org> |
---|---|
date | Sat, 21 Aug 2004 12:27:02 +0000 |
parents | 74212992fb08 |
children | f12379c41042 |
line wrap: on
line source
/* File "FastTimes.c" - Original code by Matt Slot <fprefect@ambrosiasw.com> */ /* Created 4/24/99 - This file is hereby placed in the public domain */ /* Updated 5/21/99 - Calibrate to VIA, add TBR support, renamed functions */ /* Updated 10/4/99 - Use AbsoluteToNanoseconds() in case Absolute = double */ /* Updated 2/15/00 - Check for native Time Manager, no need to calibrate */ /* Updated 2/19/00 - Fixed default value for gScale under native Time Mgr */ /* Updated 3/21/00 - Fixed ns conversion, create 2 different scale factors */ /* Updated 5/03/00 - Added copyright and placed into PD. No code changes */ /* Updated 8/01/00 - Made "Carbon-compatible" by replacing LMGetTicks() */ /* This file is Copyright (C) Matt Slot, 1999-2000. It is hereby placed into the public domain. The author makes no warranty as to fitness or stability */ #include <Gestalt.h> #include <LowMem.h> #include <CodeFragments.h> #include <DriverServices.h> #include <Timer.h> #include "FastTimes.h" /* **** **** **** **** **** **** **** **** **** **** **** **** **** **** **** */ /* **** **** **** **** **** **** **** **** **** **** **** **** **** **** **** */ /* On 680x0 machines, we just use Microseconds(). On PowerPC machines, we try several methods: * DriverServicesLib is available on all PCI PowerMacs, and perhaps some NuBus PowerMacs. If it is, we use UpTime() : Overhead = 2.1 µsec. * The PowerPC 601 has a built-in "real time clock" RTC, and we fall back to that, accessing it directly from asm. Overhead = 1.3 µsec. * Later PowerPCs have an accurate "time base register" TBR, and we fall back to that, access it from PowerPC asm. Overhead = 1.3 µsec. * We can also try Microseconds() which is emulated : Overhead = 36 µsec. On PowerPC machines, we avoid the following: * OpenTransport is available on all PCI and some NuBus PowerMacs, but it uses UpTime() if available and falls back to Microseconds() otherwise. * InputSprocket is available on many PowerMacs, but again it uses UpTime() if available and falls back to Microseconds() otherwise. Another PowerPC note: certain configurations, especially 3rd party upgrade cards, may return inaccurate timings for the CPU or memory bus -- causing skew in various system routines (up to 20% drift!). The VIA chip is very accurate, and it's the basis for the Time Manager and Microseconds(). Unfortunately, it's also very slow because the MacOS has to (a) switch to 68K and (b) poll for a VIA event. We compensate for the drift by calibrating a floating point scale factor between our fast method and the accurate timer at startup, then convert each sample quickly on the fly. I'd rather not have the initialization overhead -- but it's simply necessary for accurate timing. You can drop it down to 30 ticks if you prefer, but that's as low as I'd recommend. Under MacOS 9, "new world" Macs (iMacs, B+W G3s and G+W G4s) have a native Time Manager implementation: UpTime(), Microseconds(), and TickCount() are all based on the same underlying counter. This makes it silly to calibrate UpTime() against TickCount(). We now check for this feature using Gestalt(), and skip the whole calibration step if possible. */ /* **** **** **** **** **** **** **** **** **** **** **** **** **** **** **** */ /* **** **** **** **** **** **** **** **** **** **** **** **** **** **** **** */ #define RTCToNano(w) ((double) (w).hi * 1000000000.0 + (double) (w).lo) #define WideTo64bit(w) (*(UInt64 *) &(w)) /* LMGetTicks() is not in Carbon and TickCount() has a fair bit of overhead, so for speed we always read lowmem directly. This is a MacOS X no-no, but it always work on those systems that don't have a native Time Manager (ie, anything before MacOS 9) -- regardless whether we are in Carbon or not! */ #define MyLMGetTicks() (*(volatile UInt32 *) 0x16A) #if GENERATINGPOWERPC static asm UnsignedWide PollRTC(void); static asm UnsignedWide PollTBR(void); static Ptr FindFunctionInSharedLib(StringPtr libName, StringPtr funcName); static Boolean gInited = false; static Boolean gNative = false; static Boolean gUseRTC = false; static Boolean gUseTBR = false; static double gScaleUSec = 1.0 / 1000.0; /* 1 / ( nsec / usec) */ static double gScaleMSec = 1.0 / 1000000.0; /* 1 / ( nsec / msec) */ /* Functions loaded from DriverServicesLib */ typedef AbsoluteTime (*UpTimeProcPtr)(void); typedef Nanoseconds (*A2NSProcPtr)(AbsoluteTime); static UpTimeProcPtr gUpTime = NULL; static A2NSProcPtr gA2NS = NULL; #endif /* GENERATINGPOWERPC */ /* **** **** **** **** **** **** **** **** **** **** **** **** **** **** **** */ /* **** **** **** **** **** **** **** **** **** **** **** **** **** **** **** */ void FastInitialize() { SInt32 result; if (!gInited) { #if GENERATINGPOWERPC /* Initialize the feature flags */ gNative = gUseRTC = gUseTBR = false; /* We use CFM to find and load needed symbols from shared libraries, so the application doesn't have to weak-link them, for convenience. */ gUpTime = (UpTimeProcPtr) FindFunctionInSharedLib( "\pDriverServicesLib", "\pUpTime"); if (gUpTime) gA2NS = (A2NSProcPtr) FindFunctionInSharedLib( "\pDriverServicesLib", "\pAbsoluteToNanoseconds"); if (!gA2NS) gUpTime = nil; /* Pedantic but necessary */ if (gUpTime) { /* If we loaded UpTime(), then we need to know if the system has a native implementation of the Time Manager. If so, then it's pointless to calculate a scale factor against the missing VIA */ /* gestaltNativeTimeMgr = 4 in some future version of the headers */ if (!Gestalt(gestaltTimeMgrVersion, &result) && (result > gestaltExtendedTimeMgr)) gNative = true; } else { /* If no DriverServicesLib, use Gestalt() to get the processor type. Only NuBus PowerMacs with old System Software won't have DSL, so we know it should either be a 601 or 603. */ /* Use the processor gestalt to determine which register to use */ if (!Gestalt(gestaltNativeCPUtype, &result)) { if (result == gestaltCPU601) gUseRTC = true; else if (result > gestaltCPU601) gUseTBR = true; } } /* Now calculate a scale factor to keep us accurate. */ if ((gUpTime && !gNative) || gUseRTC || gUseTBR) { UInt64 tick, usec1, usec2; UnsignedWide wide; /* Wait for the beginning of the very next tick */ for(tick = MyLMGetTicks() + 1; tick > MyLMGetTicks(); ); /* Poll the selected timer and prepare it (since we have time) */ wide = (gUpTime) ? (*gA2NS)((*gUpTime)()) : ((gUseRTC) ? PollRTC() : PollTBR()); usec1 = (gUseRTC) ? RTCToNano(wide) : WideTo64bit(wide); /* Wait for the exact 60th tick to roll over */ while(tick + 60 > MyLMGetTicks()); /* Poll the selected timer again and prepare it */ wide = (gUpTime) ? (*gA2NS)((*gUpTime)()) : ((gUseRTC) ? PollRTC() : PollTBR()); usec2 = (gUseRTC) ? RTCToNano(wide) : WideTo64bit(wide); /* Calculate a scale value that will give microseconds per second. Remember, there are actually 60.15 ticks in a second, not 60. */ gScaleUSec = (60.0 * 1000000.0) / ((usec2 - usec1) * 60.15); gScaleMSec = gScaleUSec / 1000.0; } #endif /* GENERATINGPOWERPC */ /* We've initialized our globals */ gInited = true; } } /* **** **** **** **** **** **** **** **** **** **** **** **** **** **** **** */ /* **** **** **** **** **** **** **** **** **** **** **** **** **** **** **** */ UInt64 FastMicroseconds() { UnsignedWide wide; UInt64 usec; #if GENERATINGPOWERPC /* Initialize globals the first time we are called */ if (!gInited) FastInitialize(); if (gNative) { /* Use DriverServices if it's available -- it's fast and compatible */ wide = (*gA2NS)((*gUpTime)()); usec = (double) WideTo64bit(wide) * gScaleUSec + 0.5; } else if (gUpTime) { /* Use DriverServices if it's available -- it's fast and compatible */ wide = (*gA2NS)((*gUpTime)()); usec = (double) WideTo64bit(wide) * gScaleUSec + 0.5; } else if (gUseTBR) { /* On a recent PowerPC, we poll the TBR directly */ wide = PollTBR(); usec = (double) WideTo64bit(wide) * gScaleUSec + 0.5; } else if (gUseRTC) { /* On a 601, we can poll the RTC instead */ wide = PollRTC(); usec = (double) RTCToNano(wide) * gScaleUSec + 0.5; } else #endif /* GENERATINGPOWERPC */ { /* If all else fails, suffer the mixed mode overhead */ Microseconds(&wide); usec = WideTo64bit(wide); } return(usec); } /* **** **** **** **** **** **** **** **** **** **** **** **** **** **** **** */ /* **** **** **** **** **** **** **** **** **** **** **** **** **** **** **** */ UInt64 FastMilliseconds() { UnsignedWide wide; UInt64 msec; #if GENERATINGPOWERPC /* Initialize globals the first time we are called */ if (!gInited) FastInitialize(); if (gNative) { /* Use DriverServices if it's available -- it's fast and compatible */ wide = (*gA2NS)((*gUpTime)()); msec = (double) WideTo64bit(wide) * gScaleMSec + 0.5; } else if (gUpTime) { /* Use DriverServices if it's available -- it's fast and compatible */ wide = (*gA2NS)((*gUpTime)()); msec = (double) WideTo64bit(wide) * gScaleMSec + 0.5; } else if (gUseTBR) { /* On a recent PowerPC, we poll the TBR directly */ wide = PollTBR(); msec = (double) WideTo64bit(wide) * gScaleMSec + 0.5; } else if (gUseRTC) { /* On a 601, we can poll the RTC instead */ wide = PollRTC(); msec = (double) RTCToNano(wide) * gScaleMSec + 0.5; } else #endif /* GENERATINGPOWERPC */ { /* If all else fails, suffer the mixed mode overhead */ Microseconds(&wide); msec = ((double) WideTo64bit(wide) + 500.0) / 1000.0; } return(msec); } /* **** **** **** **** **** **** **** **** **** **** **** **** **** **** **** */ /* **** **** **** **** **** **** **** **** **** **** **** **** **** **** **** */ StringPtr FastMethod() { StringPtr method = "\p<Unknown>"; #if GENERATINGPOWERPC /* Initialize globals the first time we are called */ if (!gInited) FastInitialize(); if (gNative) { /* The Time Manager and UpTime() are entirely native on this machine */ method = "\pNative UpTime()"; } else if (gUpTime) { /* Use DriverServices if it's available -- it's fast and compatible */ method = "\pUpTime()"; } else if (gUseTBR) { /* On a recent PowerPC, we poll the TBR directly */ method = "\pPowerPC TBR"; } else if (gUseRTC) { /* On a 601, we can poll the RTC instead */ method = "\pPowerPC RTC"; } else #endif /* GENERATINGPOWERPC */ { /* If all else fails, suffer the mixed mode overhead */ method = "\pMicroseconds()"; } return(method); } /* **** **** **** **** **** **** **** **** **** **** **** **** **** **** **** */ /* **** **** **** **** **** **** **** **** **** **** **** **** **** **** **** */ #pragma mark - #if GENERATINGPOWERPC asm static UnsignedWide PollRTC_() { entry PollRTC /* Avoid CodeWarrior glue */ machine 601 @AGAIN: mfrtcu r4 /* RTCU = SPR 4 */ mfrtcl r5 /* RTCL = SPR 5 */ mfrtcu r6 cmpw r4,r6 bne @AGAIN stw r4,0(r3) stw r5,4(r3) blr } /* **** **** **** **** **** **** **** **** **** **** **** **** **** **** **** */ /* **** **** **** **** **** **** **** **** **** **** **** **** **** **** **** */ asm static UnsignedWide PollTBR_() { entry PollTBR /* Avoid CodeWarrior glue */ machine 604 @AGAIN: mftbu r4 /* TBRU = SPR 268 */ mftb r5 /* TBRL = SPR 269 */ mftbu r6 cmpw r4,r6 bne @AGAIN stw r4,0(r3) stw r5,4(r3) blr } /* **** **** **** **** **** **** **** **** **** **** **** **** **** **** **** */ /* **** **** **** **** **** **** **** **** **** **** **** **** **** **** **** */ static Ptr FindFunctionInSharedLib(StringPtr libName, StringPtr funcName) { OSErr error = noErr; Str255 errorStr; Ptr func = NULL; Ptr entry = NULL; CFragSymbolClass symClass; CFragConnectionID connID; /* Find CFM containers for the current archecture -- CFM-PPC or CFM-68K */ if (/* error = */ GetSharedLibrary(libName, kCompiledCFragArch, kLoadCFrag, &connID, &entry, errorStr)) return(NULL); if (/* error = */ FindSymbol(connID, funcName, &func, &symClass)) return(NULL); return(func); } #endif /* GENERATINGPOWERPC */