H-Buddhism serves as a medium for the exchange of information regarding academic resources, new research projects, scholarly publications, university job listings, and so forth, for specialists in Buddhist Studies who are currently affiliated with academic institutions.

Recent Content

Re: CALL FOR DATA> Vectorization Analysis of Taishō (Fritz Kemler)

Thank you very much for your interest

The vectorization provides the means to measure similarity/distance between pairs of texts. The distance metric in turn provides the base for hierarchical clustering - grouping. Strong similarity might imply identical authorship or influencing.

This method could be used to confirm relationships established by traditional philological means. On the other hand unexpected clustering results could provide hints where to look next with conventional research.

CFP> SEECHAC Colloquium Heidelberg 16-18 November 2015: Deadline for abstracts approaching

Dear Colleagues,
 
the deadline for abstracts for the Fourth International SEECHAC
Colloquium (Heidelberg 16-18 Nov 2015) «Religious Revivals and Artistic
Renaissance in Central Asia and the Himalayan Region – past and
present» is approaching. I apologize for the noise if this message

NEW BOOK> Tibetan Buddhism in Diaspora: Cultural re-signification in practice and institutions

Dear colleagues,

 

I have the pleasure to announce the publication of my book, Tibetan Buddhism in Diaspora: Cultural re-signification in practice and institutions (Routledge Critical Studies in Buddhism).

 

http://www.routledge.com/books/details/9780415719117/

 

From the editor’s description:

 

Re: CALL FOR DATA> Vectorization Analysis of Taishō (Fritz Kemler)

Dear Dr. Kemler,

I was not at all aware of "TF-IDF" analysis, so I looked at the Wikipedia article on it. It seems very interesting. I also looked at the web page you mentioned in your posting. It also seems very interesting. But for us who have no ideas about these informatic technology, could you please elaborate a little further what we would be able to do with your tools?

Re: CALL FOR DATA> Vectorization Analysis of Taishō (Fritz Kemler)

Hallo Dr. Kemler,

Generally the CBETA version of the Taisho canon is perhaps better downloaded from github than from the DVD. An XML version containing the line numbers is here:
https://github.com/cbeta-org/xml-p5/tree/master/T

Jen-jou Hung at DILA maintains a more streamlined version of this:
https://github.com/ddbc/CBETA_TAFxml
This version basically preserves just the line numbers and the differences between prose and verse passages, which is relavant for some questions.

Pages