We did see an error with cuInit, but after restarting condor things showed up properly. Is there a way to have condor check these things from time to time?
You could configure HTCondor to check (using a startd cron job), but as far as I know there's no built-in support. The bigger problem is that, AFAIK, you have to restart the startd to add GPUs.
-- ToddM