May 2009
3 posts
A list of things that are about parallel matrix...
I want a thing that stores sparse matrices over N computers and I can do math on those matrices.
My Dream API:
matrix = new(matrix_name, cols, rows)
matrix.put(col, row, value) # (or matrix.put_col(row, data) , matrix.put_row(col, data) )
value = matrix.get(col, row)
new_matrix = matrix.multiply(matrix_B)
matrix.transpose()
new_matrix = matrix.invert(iterations=0)
[U, S, V] =...
MPI in python on EC2 talk →
About to tread back into these waters. It’s been 4 years since I last ran anything in MPI, LAM or SCALAPACK. A bit nervous.
Why is NLTK so slow people
AMZN small instance (snail style)
### Took 92.35s to parse 10005 words 351 sentences (76.64% passed.) 0.26s per sentence.
Mac Pro
### Took 26.47s to parse 10005 words 351 sentences (76.64% passed.) 0.08s per sentence.
Where “parse” is pos_tag, an NP chunker (RegexpParser w/ our own grammar.) Most of the work is in pos_tag.