About
I am a post-doctoral researcher (Lehrstulmitarbeiter) at Vasishth lab, Dept. of Linguistics, Univ. of Potsdam.
Before joining Potsdam, I was a research lecturer at
LTRC, IIIT-Hyderabad where I completed my PhD in Computational Linguistics in June 2011.
For my thesis, I had been working towards
building a generic dependency parsing paradigm for morphologically rich free word order languages (with special focus on Indian languages, such as Hindi, Bangla, Telugu, etc.).
Towards this end, I have proposed a generalized 2-stage constraint
based hybrid parsing approach. I have also applied some of the
salient properties of this paradigm to data-driven dependency
parsing. I was also closely involved in the development of Hindi/Urdu dependency
treebank that is part of a larger
Hindi/Urdu treebanking project.
My broad research interests are natural language (NL) Parsing, NL Modeling, Dependency Grammar, and Cognitive Science.
Contact
University of Potsdam
Department of Linguistics, Haus 14
Karl-Liebknecht-Str. 24-25
D-14476 Potsdam, Germany
E-mail: husain AT uni-potsdam DOT de
Publications
Dependency Parsing
2011
- Linguistically Rich Graph Based Data Driven Parsing For Hindi. Samar Husain, Raghu Pujitha Gade and Rajeev Sangal. In Proceedings of IWPT2011 workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2011). Dublin, Ireland. [pdf]
- Clausal parsing helps data-driven dependency parsing: Experiments with Hindi. Samar Husain, Phani Gadde, Joakim Nivre and Rajeev Sangal. In Proceedings of IJCNLP 2011. [pdf]
- A Constraint Based Hybrid Dependency Parser for Telugu. Sruthilaya Reddy Kesidi, Prudhvi Kosaraju, Meher Vijay and Samar Husain. In Proceedings of the 12th CICLing, Tokyo, Japan. 2011. [pdf]
2010
- The ICON-2010 Tools Contest on Indian Language Dependency Parsing. Samar Husain, Prashanth Mannem, Bharat Ram Ambati, and Phani Gadde. In Proceedings of ICON-2010 Tools Contest on Indian Language Dependency Parsing. Kharagpur, India. [pdf] [slides]
- Experiments with MaltParser for parsing Indian Languages. Sudheer Kolachina, Prasanth Kolachina, Manish Agarwal and Samar Husain. In Proceedings of ICON-2010 Tools Contest on Indian Language Dependency Parsing. Kharagpur, India. [pdf]
- A Two Stage Constraint Based Hybrid Dependency Parser for Telugu. Sruthilaya Reddy Kesidi, Prudhvi Kosaraju, Meher Vijay and Samar Husain. In Proceedings of ICON-2010 Tools Contest on Indian Language Dependency Parsing. Kharagpur, India. [pdf]
- On the Role of Morphosyntactic Features in Hindi Dependency Parsing. Bharat Ram Ambati, Samar Husain, Joakim Nivre and Rajeev Sangal. In Proceedings of NAACL-HLT 2010 workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2010), Los Angeles, CA. [pdf] [slides]
- Two methods to incorporate 'local morphosyntactic' features in Hindi dependency parsing. Bharat Ram Ambati, Samar Husain, Sambhav Jain, Dipti Misra Sharma and Rajeev Sangal. In Proceedings of NAACL-HLT 2010 workshop on Statistical Parsing of Morphologically Rich Languages (SPMRL 2010) Los Angeles, CA. [pdf] [slides]
- Improving Data Driven Dependency Parsing using Clausal Information. Phani Gadde, Karan Jindal, Samar Husain, Dipti M Sharma, and Rajeev Sangal. In Proceedings of NAACL-HLT 2010, Los Angeles, CA. 2010.
[pdf] [slides]
2009
- Dependency Parsers for Indian Languages. Samar Husain. In Proceedings of ICON09 NLP Tools Contest: Indian Language Dependency Parsing. Hyderabad, India. 2009. [pdf] [slides]
- Constraint Based Hybrid Approach to Parsing Indian Languages. Akshar Bharati, Samar Husain, Meher Vijay, Kalyan Deepak, Dipti Misra Sharma and Rajeev Sangal. In Proceedings of The 23rd Pacific Asia Conference on Language, Information and Computation (PACLIC 23). Hong Kong. 2009. [pdf]
- Effect of Minimal Semantics on Dependency Parsing. Bharat Ram Ambati, Pujitha Gade, Chaitanya GSK and Samar Husain. In RANLP09 student paper workshop. [pdf]
- Two stage constraint based hybrid approach to free word order language dependency parsing. Akshar Bharati, Samar Husain, Dipti Misra Sharma and Rajeev Sangal. In Proceedings of the 11th International Conference on Parsing Technologies (IWPT09). Paris. 2009. [pdf] [slides]
- A modular cascaded approach to complete parsing. Samar Husain, Phani Gadde, Bharat Ambati, Dipti Misra Sharma and Rajeev Sangal. In Proceedings of the COLIPS International Conference on Asian Language Processing 2009 (IALP). Singapore. 2009. [pdf]
2008
- Two semantic features make all the difference in Parsing accuracy. Akshar Bharati, Samar Husain, Bharat Ambati, Sambhav Jain, Dipti M Sharma and Rajeev Sangal. In Proceedings of the 6th International Conference on Natural Language Processing (ICON-08), CDAC Pune, India. 2008. [pdf] [slides]
- A Two-Stage Constraint Based Dependency Parser for Free Word Order Languages. Akshar Bharati, Samar Husain, Dipti Misra Sharma, and Rajeev Sangal. In Proceedings of the COLIPS International Conference on Asian Language Processing 2008 (IALP). Chiang Mai, Thailand. 2008. [pdf] [slides]
- A Graph Based Method for Building Multilingual Weakly Supervised Dependency Parsers. Jagadeesh Gorla, Anil Kumar Singh, Rajeev Sangal, Karthik Gali, Samar Husain and Sriram Venkatapathy. In Proceedings of the 6th International Conference on Natural Language Processing (GoTAL). Gothenburg, Sweden. 2008. [pdf]
Treebanking and Dependency Grammar
2012
- Analyzing parser errors to improve parsing accuracy and to inform treebanking decisions. Samar Husain and Bhasha Agarwal. In Proceedings of 10th International Workshop on Treebanks and Linguistic Theories (TLT10). Heidelberg, Germany. 2012. [pdf] [slides]
2011
- Error Detection for Treebank Validation. Bharat Ram Ambati, Rahul Agarwal, Mridul Gupta, Samar Husain and Dipti Misra Sharma. In Proceedings of IJCNLP 2011 Workshop on Asian Language Resources, Chiang Mai, Thailand. 2011. [pdf]
- Empty Categories in Hindi Dependency Treebank: Analysis and Recovery. Chaitanya GSK, Samar Husain and Prashanth Mannem. In Proceedings of 5th Linguistic Annotation Workshop (at ACL HLT 2011). Portland, Oregon. 2011. [pdf]
- A classification of dependencies in the Hindi/Urdu treebank. Ashwini Vaidya and Samar Husain. Abstract presentation at UMass Workshop on South Asian Syntax and Semantics, Amherst. 2011. [pdf] [slides]
2010
- Partial Parsing as a Method to Expedite Dependency Annotation of a Hindi Treebank. Mridul Gupta, Vineet Yadav, Samar Husain and Dipti Misra Sharma. In Proceedings of The 7th International Conference on Language Resources and Evaluation (LREC). Valleta. Malta. 2010. [pdf]
- A high recall error identification tool for Hindi treebank validation. Bharat Ram Ambati, Mridul Gupta, Samar Husain and Dipti Misra Sharma. In Proceedings of The 7th International Conference on Language Resources and Evaluation (LREC). Valleta. Malta. 2010. [pdf]
- Issues in analyzing Telugu sentences towards building a Telugu Treebank. Chaitanya Vempaty, Viswanath Naidu, Samar Husain, Ravi Kiran, Lakshmi Bai, Dipti M Sharma, and Rajeev Sangal In Proceedings of CICLing-2010. Iai, Romania. 2010. [pdf]
2009
- A karaka-based dependency annotation scheme for English. Ashwini Vaidya, Samar Husain, Prashanth Mannem, Dipti Misra Sharma. In Proceedings of the CICLing-2009, Mexico City, Mexico. 2009. [pdf]
2008
- A Rule Based Approach for Automatic Annotation of a Hindi TreeBank. Mridul Gupta, Vineet Yadav, Samar Husain, Dipti M Sharma. In Proceedings of the 6th International Conference on Natural Language Processing (ICON-08), CDAC Pune, India. 2008. [pdf]
- Towards an Annotated Corpus of Discourse Relations in Hindi. Rashmi Prasad, Samar Husain, Dipti Misra Sharma and Aravind K. Joshi. In Proceeding of IJCNLP 2008 Workshop on Asian Language Resources, Hyderabad, India. 2008. [pdf] [slides]
- Dependency Annotation Scheme for Indian Languages. Rafiya Begum, Samar Husain, Arun Dhwaj, Dipti Misra Sharma, Lakshmi Bai and Rajeev Sangal. In Proceedings of The Third International Joint Conference on Natural Language Processing (IJCNLP). Hyderabad, India. 2008. [pdf]
Machine Translation
2009
- Using Levin's verb classification for preposition sense selection in English to Indian language MT. Samar Husain, Phani Chaitanya, Ganeshwar Rao Dulam, Tariq Khan, and Dipti M. Sharma. In Proceedings of the Conference on Language and Technology 2009 (CLT09), Lahore, Pakistan. 2009. [pdf]
2007
- Disambiguating Tense, Aspect and Modality Markers for Correcting Machine Translation Errors. Anil Kumar Singh, Samar Husain, Harshit Surana, Jagadeesh Gorla, Chinnappa Guggilla and Dipti Misra Sharma. In Proceedings of the Conference on Recent Advances in Natural Language Processing (RANLP). Borovets, Bulgaria. 2007. [pdf]
- Simple Preposition Correspondence: A problem in English to Indian language Machine Translation. Samar Husain, Dipti Misra Sharma and Manohar Reddy. In Proceedings of the 4th ACL-SIGSEM Workshop on Prepositions, ACL. Prague, Czech Republic. 2007. [pdf]
- Exploring Translation Similarities for Building a Better Sentence Aligner. Anil Kumar Singh and Samar Husain. In Proceedings of the 3rd Indian International Conference on Artificial Intelligence. Pune, India. 2007. [pdf]
2005
- Comparison, Selection and Use of Sentence Alignment Algorithms for New Language Pairs, Anil Kumar Singh and Samar Husain. In the Proceedings of the ACL-05: Association for Computational Linguistics Workshop on Building and Using Parallel Texts, 29-30 June 2005, Ann Arbor, USA. [pdf]
Others
2011
- Identification of Conjunct Verbs in Hindi and Its Effect on Parsing Accuracy. Rafiya Begum, Karan Jindal, Ashish Jain, Samar Husain and Dipti Misra Sharma. In Proceedings of the 12th CICLing, Tokyo, Japan. 2011. [pdf]
2010
- Grammar Extraction from Treebanks for Hindi and Telugu. Prasanth Kolachina, Sudheer Kolachina, Anil Kumar Singh, Viswanath Naidu, Samar Husain, Rajeev Sangal and Akshar Bharati. In Proceedings of The 7th International Conference on Language Resources and Evaluation (LREC). Valleta. Malta. 2010. [pdf]
2008
- Developing Verb Frames in Hindi. Rafiya Begum, Samar Husain, Dipti Misra Sharma and Lakshmi Bai. In Proceedings of The Sixth International Conference on Language Resources and Evaluation (LREC). Marrakech, Morocco. 2008. [pdf]
Teaching
At Potsdam
Computational tools for psycholinguistics, Winter, 2011 [Course Plan] [Moodle course page]
Psycholinguistic theories of parsing, Winter, 2011 [Course Plan] [Moodle course page]
At IIIT-Hyderabad
Computational Linguistics (HS5090), Monsoon, 2010 (with Dr. Soma Paul)
Introduction to Natural Language Processing (CS4725), Monsoon, 2010 (with Dr. V. Sriram)
NLP Applications (CS5728), Spring, 2010 (with Dr. V. Sriram)
Computational Linguistics (HS5090), Monsoon, 2009 (with Dr. Soma Paul, Dr. Dipti Sharma)
Introduction to Natural Language Processing (CS4725), Monsoon, 2009 (with Dr. V. Sriram)
Introduction to Natural Language Processing (CS4725), Monsoon, 2008 (with Prof. Rajeev Sangal)
Computer and Scripting II (CS5003), Spring, 2006-07
Computer and Scripting I (CS5002), Monsoon, 2006-07
Talks, etc.
Talks
- Computerlinguistisches Kolloquium 2011, Dept. of Linguistics, University of Potsdam, Germany.
- South Asian Languages: Formal Approaches and Computational Resources, Linguistic Institute 2011, Boulder, Colorado. [slides]
- Dependency Parsing Workshop 2009, Boulder, Colorado, June 2009. [slides]
- Pre-conference tutorial on Treebanking. ICON-08, Pune, India.
- CGMIL, 2008, Hyderabad [slides]
- IIIT-Hyderabad Advanced School on Natural Language Processing, May 26th - June 9th, Hyderabad, India, Summer 2008
- TCS NLP Winter School 2008, 24 December, 2007 - 7 January, 2008. [slides]
Organizer
- 3rd Advanced School on Natural Language Processing, May 23rd - June 5th, 2011, IIIT, Hyderabad, India.
- 2nd tools contest on IL dependency parsing, ICON10, Kharagpur, India
- Tools contest on IL dependency parsing, ICON09, Hyderabad, India
- Workshop on Computational Grammatical Models for Indian Languages, June 9 - 11, 2008, IIIT, Hyderabad, India.
Links
Last updated: 29 November, 2011