First, this is an important and basic result. Mildly rephrased, it says: if you have a finite linearly dependent set of vectors, then by looking from left to right you can always find one vector which lies in the span of the vectors to its left in your list. Compare this with the statement that a finite set of vectors is linearly dependent iff you can always find one vector in the list which is a linear combination of the other vectors. The latter statement is a more immediate consequence of the definition: if you have a nontrivial linear combination $a_1 v_1 + \ldots + a_n v_n = 0$, then by definition we must have $a_i \neq 0$ for at least one $i$, and then
$$v_i = \frac{-a_1}{a_i} v_1 + \ldots + \frac{-a_{i-1}}{a_i}v_{i-1} + \frac{-a_{i+1}}{a_i} v_{i+1} + \ldots + \frac{-a_n}{a_i} v_n.$$
Axler’s statement is stronger in that it involves the ordering of the vectors. This is surprisingly useful, and you should look ahead to see applications if you haven’t already.
About the $v_1$: it’s the first vector on the list, so that’s the distinguished role it plays. Note also that I would state the result a bit differently: I would allow $v_1$ to be a linear combination of the previous, empty, list of vectors, which happens iff $v_1 = 0$. But when thinking about linear dependence early on it is probably best to concentrate on the case when none of the vectors are zero, because putting the $0$ vector into the list automatically makes the list linear dependent and is a rather degenerate case.
For your other questions:
We pick the largest $j$ because this definition enables us to write $v_j$ as a linear combination of previous vectors in the list only, which is what we are trying to do.
Not all of the $a_2,\ldots,a_m$ can be zero: since we’ve assumed that $a_1,\ldots,a_m$ are not all zero, if all but $a_1$ are zero then we must have $a_1 \neq 0$. But then
$$0 = a_1 v_1 + a_2 v_2 + \ldots + a_m v_m = a_1 v_1,$$
and since $a_1 \neq 0$, we get $v_1 = 0$, and we’ve assumed it isn’t. (Note that you write “because $v_1 = 0$” where you should write “because $v_1 \neq 0$”.)