Multilingual Pulse
This section describes the multilingual features of Pulse and gives a brief explanation on how to use the sentimentAnalysis() functions for different supported languages.
Pulse can analyze text in different languages. Currently English and Spanish are supported. You can specify the language that is analyzed in three ways:
- Provide the language as argument: if there is a language specified in the document record, then it can be used for analyzing the text by passing it as argument. This is particularly useful when a dataset contains texts in different languages. If the language in a record is not a supported one, then it is ignored.
-
Provide the language as parameter: if there is no value specified for the language for a document record, Pulse uses the value specified for the language parameter in the query to get the language.
Note: If you provide the language parameter more than once, then the last value specified is used.
- Do not provide an argument or parameter and use the default language. If the language is neither specified in the record nor by the user, then Pulse defaults to English unless you have changed the default language. To change the default language use the SetDefaultLanguage function.
Note: If you provide both an argument and a parameter, then the argument is used as the language. If the argument is not valid then the parameter is used. If neither the argument or parameter are valid then the default language is used.
Note: Accents are removed from characters in attributes. Additionally, a "u" with a dieresis is converted to a plain "u" and an "n" with a diacritical tilde is replace with a plain "n".
Functions that use language as parameter and/or as argument:
Other functions can use the language only as a parameter (if not provided, the function uses the default language):
- GetLoadedDictionary
- GetLoadedMapping
- LoadDictionary
- LoadMapping
- GetAllDictionaryWords
- GetAllMappingWords