Dont understand why you want it in big chunks, but guess you have your reasons. It complicates things.
Using a 'wait' that way in a functions is impossible. 'Function.param(n)' has a value and a meaning only inside the function. Postpone an action with 4 seconds, brings that action way out of that function, hence, the meaning for 'Function.param(n)' is gone in the wind.
O would say, bring in an object, call it clock. Attach the timer behaviour to it. Now in the function, after the seek, start a timer with time param(1) and a timer tag. Use a new condition 'on timer' and stop the sound in it.
But, that is a lot of work. That will be a timer for each sample.
Other ways i can think of. If the samples are pieces definable in round seconds (should be possible because you have silence between them), then you can pre compose the samples in an array.
Check the array in a 'every 1 second', and make decession to stop/play something according the array.