damooo / vv-samsadhani

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

The distribution contains various tools related to Sanskrit computations developed under the guidance of Amba Kulkarni since 2002.

Pre-requisites:
apache HTTP server
bash
bison
flex
graphviz
gcc
g++
lttoolbox
make
perl
python
xsltproc
default-jdk

AND

Ocaml, Ocamlp4 patch

-------------------------------------------------------------------
Installation commands for Pre-requisites are as under:

sudo apt-get install apache2 bash bison flex graphviz gcc lttoolbox make perl python xsltproc default-jdk g++

#Install Objective Caml (ocaml-4.07.1.tar.gz)
available at http://ocaml.org/releases/
./configure
make world.opt
sudo make install

#Install package Ocamlbuild from (ocamlbuild-0.13.0.tar.gz)
available at https://github.com/ocaml/ocamlbuild/releases
make configure #% beware: not ./configure
make
sudo make install

#Then install Camlp4 (camlp4-4.07-1.tar.gz)
https://github.com/ocaml/camlp4/releases
./configure
make all # beware: not just make
sudo make install

#Install Zen (zen-master.tar.gz)
https://gitlab.inria.fr/huet/Zen.git
cd Zen/ML 
make

sudo a2enmod cgid
sudo systemctl restart apache2

-------------------------------------------------------------------

The distribution has following modules bundled together
   Morph analyser, Morph generator, Sandhi joiner, Sandhi splitter, Anusaaraka Skt-Hnd MT system, transliteration modules, Amarakosha, Ashtadhyayi Simulator, Navya-Nyaya Diagramatic Representation, Compound word generator, and e-readers for Sankshepa Ramayanam and SrimadBhagvadGita

To install Sanskrit Computational Linguistics tools

a) git pull http://scl.samsaadhanii.in:3000/amba/scl.git

b) cp appropriate SPEC/spec*.txt file to scl directory. Name it as spec.txt
   Check that the paths are correct, else make necessary changes.

c) ./configure 

d) make

e) sudo make install

All the packages are available under GPL. You would have received a copy of GPL license with this package.

In case of any queries, please contact ambapradeep@gmail.com.


-- Amba Kulkarni
10th Sept 2018
-------------------------------------------------------------------------------

History:
We acknowledge the help of ASR Melkote who had given their resources of Morphological Analyser in 2002. This formed a starting point for us. 

Mr. Jain worked on the Sanskrit morphological analyser from 2002-2003 towards his M.Tech. thesis at IIIT-H.

Ms. Sheeba worked as a part of work on her Ph.D. thesis contributed towards the development of morphological analyser from 2004-2006. Her major contribution was for subantas and kridantas.

Dr. Anil Kumar contributed for the development of tinganta analyser between 2006-2007, especially with the Dhaturatnakar entries.

Later from 2004-2006, various students at the Rashtriya Sanskrit Vidyapeetham Tirupati contributed to the development of Sandhi package. Contribution of Ms. Sivaja Nair, Pankaj Vyasa and Ms. Sushama Vempati deserve special mention.

University of Hyderabad later supported further development under the University of Potential Excellence scheme from 2006-2007.

During 2006-2008 Pawan Goyal, IIT Kanpur worked with Amba Kulkarni towards the development of Ashtadhyayi simulator.

Though Amba Kulkarni worked on various modules at her own pace, the project got a boost when the Technology Development for Indian Languages(TDIL)  division of Ministry of Information and Communication Technology supported the activity in the form of a Consortium of 7 Institutes (2009-13).

The Principal Investigators at the 7 institutes are:
Amba Kulkarni, Department of Sanskrit Studies, University of Hyderabad (Consortium Leader)
Dipti Mishra Sharma, IIIT-H, Hyderabad
Girish Nath Jha, Special Center for Sanskrit, JNU, Delhi
Veeranarayan Pandurangi, JRRSU, Jaipur
Tirumala Kulkarni, PPVP, Bangalore
S. S. Murty, RSVP, Tirupati
Shrinivas Varkhedi, Director, Sanskrit Academy, Hyderabad

Under this project on 'Development of Sanskrit Computational tools and Sanskrit-Hindi Machine Translation system', following tools have been developed:

a) Morph analyser
b) Morph generator
c) Sandhi
d) Sandhi Splitter
e) Sanskrit-Hindi Machine Translation system (Sampark and Anusaaraka models)
f) Compound Processor

All these modules were developed at the Department of Sanskrit Studies, University of Hyderabad.

Various consortium members have contributed by developing annotated tests for building these modules. In addition JNU developed a POS tagger and IIIT-H deveoped a POS tagger and a parser, which are not part of this distribution.

During 2015-17 Amba Kulkarni was awarded a fellowship at Indian Institute of Advanced Study, Shimla. During this period she improved the parsing algorithms taking into account the yogyataa as a constraint.

Following persons had major contribution in the development of the tools:
a) Dr. Sheeba
b) Dr. Devanand Shukl
c) Dr. Anil Kumar
d) Ms. Bhavani
e) Ms. Gauri
f) Ms. Kiranmayi
g) Mr. Karunakar
h) Dr. Shivaja
i) Dr. Shailaja
j) Dr. Pavankumar Satuluri
k) Dr. Monali Das
l) Dr. Arjun K
m) Dr. Preeti Shukla

In addition Converters and Transliteration modules for converting/transliterating from one scheme to the other are developed. Following schemes have been addressed.
a) Unicode Devanagari (UTF-8)
b) WX
c) Velthuis
d) Itrans
e) SLP
f) Kyoto Harvard

Dr. Sivaja Nair worked on her Ph.D. thesis on the Amarakosha from 2007-2011. The package she developed in the process is also available for distribution.

Dr. Anil Kumar developed the Compound processor as a part of his PhD thesis from 2008-2011.

Dr. Shailaja developed the concordance of three Paninian Dhatuvrttis as a part of her PhD thesis from 2009-13.

Dr. Pavankumar Satuluri developed the compound generator as a part of his PhD thesis (2011-15).

Dr. Arjuna developed the Nyaayacitradiipikaa, an analyser for Navya Nyaya Expressions as a prt of his PhD thesis (2013-17).

Dr. Monali Das provided the Kaaraka analysis of Sankshepa Ramayanam.

Dr. Preeti Patel (Shukl) developed the e-reader for SrimadBhagvadGeeta.

Dr. Anupama Ryali developed the e-reader for Shishupalavadham as a part of her Post Doctoral Fellowship.

Since 2007 Amba Kulkarni is also collaborating with GĂ©rard Huet, INRIA. As a result of this collaboration, an inter-communication between the Sanskrit Heritage tools and the Anusaaraka tools has been possible.

Finally I would like to acknowledge Prof. K V Ramkrishnamacharyulu for his guidance throughout the development of these tools.

-------------------------------------------------------------------------------

About

License:GNU General Public License v2.0


Languages

Language:HTML 97.6%Language:Perl 0.6%Language:OCaml 0.4%Language:TeX 0.4%Language:JavaScript 0.2%Language:Lex 0.2%Language:C 0.2%Language:Java 0.1%Language:Perl 6 0.1%Language:Shell 0.1%Language:CSS 0.0%Language:Makefile 0.0%Language:Python 0.0%Language:C++ 0.0%Language:Yacc 0.0%Language:XSLT 0.0%Language:CLIPS 0.0%