You are here

HLT Phonetic Scorer

This package provides a utility to compute phonetic features (i.e., rhyme, alliteration, plosive, homogeneity) of tokenized sentences. 

Requirement

Java version >= 1.8

Usage

java -jar eu.fbk.hlt.phonetics.PhoneticScorer.jar [args]

where args can be:

i) -f <inputFile> <outputFile>
For each line in the input file, it calculates the phonetic scores and writes in the output file as tab-separated (Text\tRhyme Score\tAlliteration Score\tPlosive Score\tHomogeneity Score\n).

ii) -s <input string (with quotations)>
For the input string, it outputs the phonetic scores.

iii) -i
Interactive mode. It consumes standard input one line at a time and outputs the phonetic scores for each line. Please note that in all three cases, the program expects already tokenized text in which tokens are space separated. If your text is not tokenized, please consider tokenizing them before providing it to the phonetic scorer.

Please refer to the examples directory and the java doc for usage examples.

Terms of Use

The package includes two external resources (i.e. CMU pronunciation dictionary and Variant Conversion Info (VarCon) lexicon), both of which are free to use for non-commercial applications. The respective licences are available in the src/resources/lexical directory.

The phonetic scorers are described and used in the following paper:

@InProceedings{ozbal-pighin-strapparava:2013:ACL2013,
author = {\"{O}zbal, G\"{o}zde and Pighin, Daniele and Strapparava, Carlo},
title = {BRAINSUP: Brainstorming Support for Creative Sentence Generation},
booktitle = {Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)},
month = {August},
year = {2013},
address = {Sofia, Bulgaria},
publisher = {Association for Computational Linguistics},
pages = {1446--1455},
url = {http://www.aclweb.org/anthology/P13-1142}
}

If you use this package in your research, please cite this paper.

The package is free to use for non-commercial uses.

A research licence is granted through the following online form to scholars working for academic and research institutions. Please give us clear evidence about your affiliation (for instance the e-mail account and home-page).

If you are an undergraduate or master student the licence should be submitted by a professor of your University.

Agreement terms:

LICENCE AGREEMENT

FONDAZIONE BRUNO KESSLER (FBK)

HLT Phonetic Scorer (Tool)

1) Grant:

The Fondazione Bruno Kessler (FBK) hereby grants to you a non-exclusive, non-transferable, perpetual, royalty-free and worldwide license (the “License”) to use HLT Phonetic Scorer (the “Scorer”) solely for educational and research purposes, in accordance with Paragraph 2 below and subject to the terms and conditions of this License Agreement (the “Agreement”).

2) Limitations on Use:

The License is limited to non-commercial use. Non-commercial use relates only to educational and research purposes. Any other use is commercial
use. You may not use the Scorer in connection with any business activities. You may distribute and/or allow others to use a) the Scorer or b) the applications you create with the Scorer only if each new user is bound by the provisions of this Agreement.

3) Copies:

You may copy FBK material only as reasonably necessary for your licensed use.

4) Ownership:

The scorer and the accompanying documentation are licensed, not sold, to you. The  scorer is a proprietary product of FBK. FBK retains all rights not specifically granted to you hereunder, including ownership of the scorer and all copyrights, trade secrets, or other intellectual property rights in the Scorer and any accompanying information.

5) Publication Credit:

You agree to acknowledge FBK (“FBK”, Fondazione Bruno Kessler) with appropriate citations in any publication or presentation containing research results obtained in whole or in part through the use of the Scorer. The following publication has to be cited:

Gözde Özbal, Daniele Pighin, and Carlo Strapparava. 2013. BRAINSUP: Brainstorming Support for Creative Sentence Generation. In Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics (ACL 2013), pages 1446–1455, Sofia, Bulgaria, August. Association for Computational Linguistics.

6) Term of License:

The License is effective upon receipt by you of the scorer and shall continue until terminated. The License will terminate immediately without notice by FBK if you fail to comply with the terms and conditions of this Agreement. Upon termination of this License, you shall immediately discontinue all use of the Scorer provided hereunder, and return to FBK or destroy the original and all copies of all such Scorer. All of your obligations under this Agreement shall survive the termination of the License.

7) Warranty:
FBK MAKES NO REPRESENTATIONS ABOUT THE SUITABILITY, USE, OR PERFORMANCE OF THIS DATABASE OR ABOUT ANY CONTENT OR INFORMATION MADE ACCESSIBLE BY THE DATABASE, FOR ANY PURPOSE. THE DATABASE IS
PROVIDED “AS IS,” WITHOUT EXPRESS OR IMPLIED WARRANTIES INCLUDING, BUT NOT LIMITED TO, ANY IMPLIED WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE, OR NONINFRINGEMENT WITH RESPECT
TO THE DATABASE. FBK IS NOT OBLIGED TO SUPPORT OR ISSUE UPDATES TO THE DATABASE.

8) Limitation on Liability:

This scorer is provided free of charge and, accordingly, FBK shall not be liable under any theory for any damages suffered by you or any user of the Scorer. UNDER NO CIRCUMSTANCES SHALL FBK BE LIABLE TO YOU OR ANY OTHER PERSON FOR ANY DIRECT, INDIRECT, SPECIAL, INCIDENTAL, OR CONSEQUENTIAL DAMAGES OF ANY CHARACTER INCLUDING, WITHOUT LIMITATION, DAMAGES FOR LOSS OF GOODWILL, WORK STOPPAGE, COMPUTER FAILURE OR MALFUNCTION, OR ANY AND ALL OTHER ECONOMIC LOSS OR COMMERCIAL DAMAGES ARISING OUT OF THE USE OR INABILITY TO USE THIS DATABASE, EVEN IF FBK SHALL HAVE

BEEN INFORMED OF THE POSSIBILITY OF SUCH DAMAGES, OR FOR ANY THIRD-PARTY CLAIMS.

9) Indemnification:

You agree to hold harmless, indemnify, and defend FBK, its Trustees, officers, employees, and agents from and against any loss, damage, liability, claim of loss, lawsuit, cause of
action, or other claim asserted against them or any of them arising out of, or in any way connected with, your performance of any activity hereunder.

10) Export Controls:

You agree that the scorer will not be shipped, transferred, or exported into any country or used in any manner prohibited by the Italian export laws, restrictions, or regulations.

11) Disputes/Arbitration:

This Agreement shall be governed under the laws of Italy. Any dispute between the parties arising out of or relating to this Agreement will be submitted to binding
arbitration in Trento, (Italy), pursuant to the Commercial Arbitration Rules of the Italian Arbitration Association, and judgment on the award may be entered in any court of competent
jurisdiction; provided, however, that either party may seek preliminary injunctive or other equitable relief pending arbitration to prevent irreparable harm. The prevailing party in
any arbitration or litigation shall be entitled to recover all reasonable expenses thereof, including attorney’s fees in connection with such proceedings or any appeal thereof.

12) Entire Agreement:
This Agreement contains the entire agreement between the parties with respect to the subject matter hereof, and it shall not be modified or amended except by an instrument in writing signed by both parties hereto.