Re: [Gems-users] Running SPLASH2 benchmarks with LogTM

Mailing List Archives Authenticated access	UW Madison Computer Sciences Department Computer Systems Lab

Date:	Tue, 06 Mar 2007 14:17:24 -0600
From:	Kevin Moore <kmoore@xxxxxxxxxxx>
Subject:	Re: [Gems-users] Running SPLASH2 benchmarks with LogTM

Shougata,

It sounds like tourmaline, or at least a more robustversion of it, would be perfect for you. If that works foryou, it'll be the fastest way for you to get what you want.But, I suspect that system interactions will prevent youfrom running the transactions in some of the benchmarks.You should definitely use GEMS 1.4. If for no other reasonthan it will run much faster than GEMS 1.3. To get rubywith LogTM to run as fast as possible configure the memorysystem to use a very large (and associative) L1 cache. GEMS1.4 supports fast L1 hits (1-cycle) for LogTM, but GEMS 1.3doesn't. I'd recommend using large L1 caches and seeing ifthe ruby slowdown is tolerable.


--Kevin

Dan Gibson wrote:

Shougata,

Shougata Ghosh wrote:
Hi
I am trying to run SPLASH-2 benchmarks with logTM. I have replaced thelocks in the code with magic instructions (xaction begin and commit).The cache coherence protocol I'm using is MESI_SMP_LogTM_directory. Myversion of simics is 2.2.19 and GEMS is 1.3 (not hooking up opal).
I want to collect the memory traces (along with xaction begins, commitsand aborts) and analyse them offline. I don't really care about thetiming info generated by ruby. Running with ruby slows it down too much!And since I don't need the timing info that ruby provides, I think thisslowdown is unjustified in my case!
I thought of running it with the PERFECT_MEMORY_SYSTEM=true and settingPERFECT_MEMORY_RESPONSE_LATENCY = 0, but then I figured out that willprobably break the transactional memory part of the memory system. WhenPERFECT_MEMORY_SYSTEM is true, ruby seems to completely bypass the cacheand simply return PERFECT_MEMORY_RESPONSE_LATENCY. That way, the xactionconflicts will never be detected. Can someone verify this?
The PERFECT_MEMORY_SYSTEM flags will indeed completely bypass LogTM.SPLASH would behave as an unsynchronized program. I would also expectRuby to behave in strange and unexpected ways if transactional binarieswere run with PERFECT_MEMORY_SYSTEM = true.
One alternative I thought of was using Tourmaline. While tourmalineworked for some small microbenchmarks, it always breaks when I'm tryingto run the SPLASH2 benchmarks.
The released controllers that attempt to allow transactional concurrencywere not very richly developed. They have a hard time handlingvirtualization events. However, the Serializer controller is fast andsimple, and much more robust. I don't know the specifics of yourrequirements, however... if you need non-transactional CPUs to be makingmeaningful requests then obviously Serializer is not an option.
Another option I tried is to let ruby do its thing but always return 0to simics for stall cycles. Basically, in ruby_operate(), I callmh_memorytracer_possible_cache_miss(mem_op) and then return 0. Thiswould slow the execution down somewhat but atleast it won't stallsimics. Conceptually this made sense to me but when I ran it, it gave methe following error right after the first xaction_begin:
simics-common: system/Sequencer.C:487: void Sequencer::makeRequest(constCacheMsg&): Assertion `isReady(request)' failed.
***  Simics getting shaky, switching to 'safe' mode.
***  Simics (main thread) received an abort signal, probably an assertion.
Ruby manipulates Simics's stall condition in two ways.
1) By returning non-zero values from mh_memorytracer_possible_cache_miss().
2) By calling SIMICS_stall_cycle(), usually to unstall a processor
I would expect the above behaviour to persist if most of Ruby thinks theprocessor is stalled when it is, in fact, not stalled. There are severalconditions that could be violated in isReady(), many of which are notrelated to LogTM.
Regardless, forbidding Ruby from stalling Simics will break LogTManyway, since LogTM relies first if stalling to prevent aborts, ratherthan aborting outright.
I understand there were some logTM bugs in this version of GEMS (1.3)which were fixed in the last release (1.4). Is this error being causedby one of those bugs? Is it worth the trouble to install GEMS 1.4 andtry this method out or is there something fundamentally wrong with whatI'm doing and won't work in 1.4 either?
I'm sure one of the LogTM architects will be glad to comment on this. I,for one, would reccomend the latest version, simply because its notalways straightforward to manually re-solve bugs in older versions of GEMS.
Any other ways of achieving whaty+simics (where ruby stalls simics)setup and I notic I'm trying to do?
If you're not interested in timing, try running with Ruby withSIMICS_RUBY_MULTIPLIER = 1, L1 latency 1, L2 latency 2, and MM latency3, link latencies small if needed, and make cache sizes huge (~GBs).Empirically, we know that a lot of the Simics+Ruby slowdown occursbecause Simics is spinning on stalled processors. Reducing all thelatencies should help substantially.Also, marginal increases in cpu-switch-time (eg from 1 to 5 or 10) wouldprobably speed things along somewhat, again at the expense of timingaccuracy.
Btw, I did try running barnes with the regular ruby+simics (where rubystalls simics) setup and I noticed ruby always returned 2000000000cycles as the stall cycle! What's causing this???
2 Billion is used as "an arbitrarily long stall time" -- Ruby neverreturns an exact number of cycles because Ruby will explicitly unstallSimics when the request has completed. One cannot know reliably atrequest-time the number of cycles that will be required for a givenrequest, hence Ruby simply stalls Simics (for 2 billion cycles), thenwhen the request has been satisfied (by Ruby's EventQueue.C and itsconsumers), Ruby calls SIMICS_stall_cycle() to unstall Simics.
I'd really appreciate any ideas.

Thanks in advance
shougata

_______________________________________________
Gems-users mailing list
Gems-users@xxxxxxxxxxx
https://lists.cs.wisc.edu/mailman/listinfo/gems-users
Use Google to search the GEMS Users mailing list by adding "site:https://lists.cs.wisc.edu/archive/gems-users/"; to your search.
_______________________________________________
Gems-users mailing list
Gems-users@xxxxxxxxxxx
https://lists.cs.wisc.edu/mailman/listinfo/gems-users
Use Google to search the GEMS Users mailing list by adding "site:https://lists.cs.wisc.edu/archive/gems-users/"; to your search.

[← Prev in Thread]	Current Thread	[Next in Thread→]
[Gems-users] Running SPLASH2 benchmarks with LogTM, Shougata Ghosh [Gems-users] Using Ruby parameters on SLICC code?, Enrique Vallejo Gutierrez Re: [Gems-users] Using Ruby parameters on SLICC code? -- Repeated question, ignore, Enrique Vallejo Gutierrez Re: [Gems-users] Using Ruby parameters on SLICC code? -- Repeated question, ignore, hongxia sun Re: [Gems-users] Using Ruby parameters on SLICC code?, Lei Yang Re: [Gems-users] Using Ruby parameters on SLICC code?, Liqun Cheng Re: [Gems-users] Using Ruby parameters on SLICC code?, Enrique Vallejo Gutierrez Re: [Gems-users] Running SPLASH2 benchmarks with LogTM, Dan Gibson Re: [Gems-users] Running SPLASH2 benchmarks with LogTM, Kevin Moore <= <Possible follow-up(s)> Re: [Gems-users] Running SPLASH2 benchmarks with LogTM, Shougata Ghosh Re: [Gems-users] Running SPLASH2 benchmarks with LogTM, Kevin Moore Re: [Gems-users] Running SPLASH2 benchmarks with LogTM, Shougata Ghosh

Previous by Date:	Re: [Gems-users] Using Ruby parameters on SLICC code?, Enrique Vallejo Gutierrez
Next by Date:	[Gems-users] Network Messages from Gems, Niket Agarwal
Previous by Thread:	Re: [Gems-users] Running SPLASH2 benchmarks with LogTM, Dan Gibson
Next by Thread:	Re: [Gems-users] Running SPLASH2 benchmarks with LogTM, Shougata Ghosh
Indexes:	[Date] [Thread]

Mailing List Archives

Authenticated access

Re: [Gems-users] Running SPLASH2 benchmarks with LogTM