Sinc Interpolation based on FFT

The example and statement that follows are extremely puzzling:

Why one time series built with with one impulse at t=0 and a total of 8 samples would be aliased? And if it was why would it be aliased only in position 5 of the FFT and not in additional positions?
Probably this is being taken out of its proper context and therefore would appreciate if you could elaborate on the rationale here. Thank you.