Natural Language Processing Engineer

  Home  Education and Science  Natural Language Processing Engineer


“Natural Language Processing Engineer related Frequently Asked Questions by expert members with job experience as Natural Language Processing Engineer. These questions and answers will help you strengthen your technical skills, prepare for the new job interview and quickly revise your concepts”



78 Natural Language Processing Engineer Questions And Answers

41⟩ Polysemy is defined as the coexistence of multiple meanings for a word or phrase in a text object. Which of the following models is likely the best choice to correct this problem? A) Random Forest Classifier B) Convolutional Neural Networks C) Gradient Boosting D) All of these

B) Convolutional Neural Networks

CNNs are popular choice for text classification problems because they take into consideration left and right contexts of the words as features which can solve the problem of polysemy

 205 views

42⟩ While working with context extraction from a text data, you encountered two different sentences The tank is full of soldiers. The tank is full of nitrogen. Which of the following measures can be used to remove the problem of word sense disambiguation in the sentences? A) Compare the dictionary definition of an ambiguous word with the terms contained in its neighborhood B) Co-reference resolution in which one resolute the meaning of ambiguous word with the proper noun present in the previous sentence C) Use dependency parsing of sentence to understand the meanings

A) Compare the dictionary definition of an ambiguous word with the terms contained in its neighborhood

Option 1 is called Lesk algorithm, used for word sense disambiguation, rest others cannot be used.

 192 views

43⟩ Which of the following statement is(are) true for Word2Vec model? A) The architecture of word2vec consists of only two layers – continuous bag of words and skip-gram model B) Continuous bag of word is a shallow neural network model C) Skip-gram is a deep neural network model D) Both CBOW and Skip-gram are deep neural network models E) All of the above

C) Skip-gram is a deep neural network model

Word2vec contains the Continuous bag of words and skip-gram models, which are deep neural nets.

 225 views

45⟩ Which of the following regular expression can be used to identify date(s) present in the text object “The next meetup on data science will be held on 2017-09-21, previously it happened on 31/03, 2016” A) d{4}-d{2}-d{2} B) (19|20)d{2}-(0[1-9]|1[0-2])-[0-2][1-9] C) (19|20)d{2}-(0[1-9]|1[0-2])-([0-2][1-9]|3[0-1]) D) None of the above

D) None of the above

None if these expressions would be able to identify the dates in this text object.

 199 views

46⟩ While working with text data obtained from news sentences, which are structured in nature, which of the grammar-based text parsing techniques can be used for noun phrase detection, verb phrase detection, subject detection and object detection. A) Part of speech tagging B) Dependency Parsing and Constituency Parsing C) Skip Gram and N-Gram extraction D) Continuous Bag of Words

B) Dependency Parsing and Constituency Parsing

Dependency and constituent parsing extract these relations from the text

 201 views

47⟩ Explain me what is the general principle of an ensemble method and what is bagging and boosting in ensemble method?

The general principle of an ensemble method is to combine the predictions of several models built with a given learning algorithm in order to improve robustness over a single model. Bagging is a method in ensemble for improving unstable estimation or classification schemes. While boosting method are used sequentially to reduce the bias of the combined model. Boosting and Bagging both can reduce errors by reducing the variance term.

 195 views

48⟩ What percentage of the total statements are correct with regards to Topic Modeling? It is a supervised learning technique LDA (Linear Discriminant Analysis) can be used to perform topic modeling Selection of number of topics in a model does not depend on the size of data Number of topic terms are directly proportional to size of the data A) 0 B) 25 C) 50 D) 75 E) 100

A) 0

LDA is unsupervised learning model, LDA is latent Dirichlet allocation, not Linear discriminant analysis. Selection of the number of topics is directly proportional to the size of the data, while number of topic terms is not directly proportional to the size of the data. Hence none of the statements are correct.

 188 views

50⟩ Do you know the two components of Bayesian logic program?

Bayesian logic program consists of two components. The first component is a logical one ; it consists of a set of Bayesian Clauses, which captures the qualitative structure of the domain. The second component is a quantitative one, it encodes the quantitative information about the domain.

 187 views

55⟩ Do you know what is PAC Learning?

PAC (Probably Approximately Correct) learning is a learning framework that has been introduced to analyze learning algorithms and their statistical efficiency.

 182 views

58⟩ Which of the following models can be used for the purpose of document similarity? A) Training a word 2 vector model on the corpus that learns context present in the document B) Training a bag of words model that learns occurrence of words in the document C) Creating a document-term matrix and using cosine similarity for each document D) All of the above

D) All of the above

word2vec model can be used for measuring document similarity based on context. Bag Of Words and document term matrix can be used for measuring similarity based on terms.

 217 views