<?xml version="1.0" encoding="UTF-8"?><rss
version="2.0"
xmlns:content="http://purl.org/rss/1.0/modules/content/"
xmlns:wfw="http://wellformedweb.org/CommentAPI/"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:atom="http://www.w3.org/2005/Atom"
xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
><channel><title>nlp Archives - Francesco Lelli %</title> <atom:link href="https://francescolelli.info/tag/nlp/feed/" rel="self" type="application/rss+xml" /><link>https://francescolelli.info/tag/nlp/</link> <description>Information Management, Computer Science,  Economics, Finance and more</description> <lastBuildDate>Fri, 02 Dec 2022 12:48:00 +0000</lastBuildDate> <language>en-US</language> <sy:updatePeriod> hourly </sy:updatePeriod> <sy:updateFrequency> 1 </sy:updateFrequency> <generator>https://wordpress.org/?v=6.8.5</generator><image> <url>https://francescolelli.info/wp-content/uploads/2018/11/cropped-InstrumentElement-32x32.jpg</url><title>nlp Archives - Francesco Lelli %</title><link>https://francescolelli.info/tag/nlp/</link> <width>32</width> <height>32</height> </image> <site
xmlns="com-wordpress:feed-additions:1">156264324</site> <item><title>OpenAI: What is Service Oriented Architecture (SOA) ?</title><link>https://francescolelli.info/machine-learning/openai-what-is-service-oriented-architecture-soa/</link> <comments>https://francescolelli.info/machine-learning/openai-what-is-service-oriented-architecture-soa/#respond</comments> <dc:creator><![CDATA[Francesco Lelli]]></dc:creator> <pubDate>Fri, 02 Dec 2022 12:38:29 +0000</pubDate> <category><![CDATA[Machine Learning]]></category> <category><![CDATA[more]]></category> <category><![CDATA[AI]]></category> <category><![CDATA[AI Generated Text]]></category> <category><![CDATA[nlp]]></category> <category><![CDATA[OpenAI]]></category> <category><![CDATA[OpenAIChat]]></category> <category><![CDATA[Service Oriented Architecture]]></category> <category><![CDATA[SOA]]></category> <guid
isPermaLink="false">https://francescolelli.info/?p=2434</guid><description><![CDATA[<p>I just asked to Chat.openai what is Service Oriented Architecture (see picture below) and what follows is its opinion on the topic. Folks at OpenAI developed the ChatGPT model as a way to optimizing language models for dialogues. I found this idea quite challenging with a lot of potential. Therefore, I asked to Chat.OpenAI an [&#8230;]</p><p>The post <a
href="https://francescolelli.info/machine-learning/openai-what-is-service-oriented-architecture-soa/">OpenAI: What is Service Oriented Architecture (SOA) ?</a> appeared first on <a
href="https://francescolelli.info">Francesco Lelli</a>.</p> ]]></description> <content:encoded><![CDATA[<p>I just asked to <a
href="https://openai.com/blog/chatgpt/" target="_blank" rel="noreferrer noopener">Chat.openai</a> what is Service Oriented Architecture (see picture below) and what follows is its opinion on the topic.</p><figure
class="wp-block-image size-full"><img
fetchpriority="high" decoding="async" width="1024" height="502" data-attachment-id="152" data-permalink="https://francescolelli.info/service-oriented-architecture-master-class/attachment/service-oriented-architecture/" data-orig-file="https://francescolelli.info/wp-content/uploads/2018/12/Service-Oriented-Architecture.png" data-orig-size="1024,502" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="Service-Oriented-Architecture" data-image-description="&lt;p&gt;Service Oriented Architecture (SOA) refers to a set of techniques for enhancing and re-design the IT infrastructure of a companies in order to achieve more flexibility and re-usability  &lt;/p&gt;
" data-image-caption="&lt;p&gt;Service Oriented Architecture (SOA)&lt;/p&gt;
" data-medium-file="https://francescolelli.info/wp-content/uploads/2018/12/Service-Oriented-Architecture-300x147.png" data-large-file="https://francescolelli.info/wp-content/uploads/2018/12/Service-Oriented-Architecture.png" src="https://francescolelli.info/wp-content/uploads/2018/12/Service-Oriented-Architecture.png?8011c3&amp;8011c3" alt="Service Oriented Architecture (SOA)" class="wp-image-152" srcset="https://francescolelli.info/wp-content/uploads/2018/12/Service-Oriented-Architecture.png 1024w, https://francescolelli.info/wp-content/uploads/2018/12/Service-Oriented-Architecture-300x147.png 300w, https://francescolelli.info/wp-content/uploads/2018/12/Service-Oriented-Architecture-768x377.png 768w, https://francescolelli.info/wp-content/uploads/2018/12/Service-Oriented-Architecture-600x294.png 600w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption>Service Oriented Architecture (SOA)</figcaption></figure><p>Folks at OpenAI developed the ChatGPT model as a way to optimizing language models for dialogues. I found this idea quite challenging with a lot of potential. Therefore, I asked to Chat.OpenAI an open question that on a topic that I am familiar with. As ChatGPT is design for dialog I asked also a few sub-questions as well where I was searching for more details. What follows is the unedited answers:</p><h1 class="wp-block-heading">What is Service Oriented Architecture</h1><p>Service-oriented architecture (SOA) is a software design approach that focuses on building a network of independent, modular components that provide services to other components over a network in order to form a complete application. This architecture is designed to make it easier to develop, deploy, and manage complex applications by breaking them down into smaller, independent units of functionality that can be developed, tested, and deployed independently. SOA is often used in distributed systems and relies on communication protocols such as web services to enable components to interact with each other.</p><p>Some of the key features of service-oriented architecture (SOA) include:</p><ol
class="wp-block-list"><li><strong>Modularity</strong>: SOA is based on the idea of breaking down complex applications into smaller, independent components that can be developed, tested, and deployed independently.</li><li><strong>Independence</strong>: Each component in an SOA-based system is independent and can be managed and maintained separately from other components, which makes it easier to develop, deploy, and update applications.</li><li><strong>Standardized communication protocols</strong>: SOA relies on standardized communication protocols, such as web services, to enable components to interact with each other. This allows different components to be developed using different programming languages and technologies, as long as they can communicate using the same protocol.</li><li><strong>Reusability</strong>: Because components in an SOA-based system are independent and modular, they can be easily reused in other applications, which can save time and effort during development.</li><li><strong>Flexibility</strong>: SOA allows for flexibility by enabling components to be added, removed, or updated without disrupting the overall functionality of the application. This makes it easier to adapt to changing business requirements and technology needs.</li></ol><h2 class="wp-block-heading">Modularity</h2><p>Modularity is one of the key features of service-oriented architecture (SOA), and refers to the idea of breaking down complex applications into smaller, independent components that can be developed, tested, and deployed independently. This allows each component to be managed and maintained separately from other components, which makes it easier to develop, deploy, and update applications.</p><p>In SOA, each component is called a &#8220;service&#8221; and provides a specific function or piece of functionality. For example, a service might be responsible for handling user authentication, or for performing calculations, or for storing and retrieving data from a database. By designing an application as a network of interconnected services, developers can create more modular and scalable applications that are easier to maintain and update over time.</p><p>Modularity is an important concept in SOA because it enables developers to create applications that are flexible and adaptable to changing business requirements and technology needs. For example, if a new service is needed to support a new feature, it can be added to the application without disrupting the existing services or the overall functionality of the application. Similarly, if a service needs to be updated or replaced, it can be done without affecting the rest of the application. This makes it easier to evolve and improve an application over time without having to redesign the entire system.</p><h2 class="wp-block-heading">Independence</h2><p>Independence is another key feature of service-oriented architecture (SOA), and refers to the idea that each component in an SOA-based system is independent and can be managed and maintained separately from other components. This means that developers can work on different parts of the application without having to worry about how their changes will affect other parts of the system.</p><p>Independence is an important concept in SOA because it enables developers to create applications that are flexible and adaptable to changing business requirements and technology needs. For example, if a service needs to be updated or replaced, it can be done without affecting the rest of the application. This makes it easier to evolve and improve an application over time without having to redesign the entire system.</p><p>Additionally, because each service in an SOA-based system is independent, it can be developed using different programming languages and technologies, as long as it can communicate with other services using a standardized protocol. This allows developers to choose the best tools and technologies for each individual service, which can make the development process more efficient and effective.</p><h2 class="wp-block-heading">Standardized Communication Protocols</h2><p>Standardized communication protocols are an important aspect of service-oriented architecture (SOA), and refer to the use of specific rules and standards for enabling components to communicate with each other. In SOA, these protocols are used to enable services to send and receive messages and data to each other in order to perform their designated functions and contribute to the overall functionality of the application.</p><p>One of the most common protocols used in SOA is the web services protocol, which is a set of rules and standards for exchanging data between applications or services over a network. This protocol allows services to be developed using different programming languages and technologies, as long as they can communicate using the same protocol. This enables developers to create more modular and scalable applications that are easier to maintain and update over time.</p><p>In addition to the web services protocol, there are many other standardized communication protocols that are commonly used in SOA, such as REST, JSON, and XML. These protocols are designed to enable services to communicate with each other in a standardized and interoperable way, which makes it easier to develop and deploy complex applications.</p><p>Overall, the use of standardized communication protocols is an important aspect of SOA because it allows different components to be developed and maintained independently, while still being able to work together to form a complete application. This makes it easier to create flexible and adaptable applications that can evolve and improve over time.</p><h2 class="wp-block-heading">Reusability</h2><p>Reusability is a key benefit of service-oriented architecture (SOA), and refers to the ability to use components or services in multiple applications or contexts. In SOA, each component is called a &#8220;service&#8221; and provides a specific function or piece of functionality. By designing an application as a network of interconnected services, developers can create more modular and scalable applications that are easier to maintain and update over time.</p><p>One of the key advantages of this modular approach is that it allows services to be easily reused in other applications. For example, if a service is developed to handle user authentication, that service can be used in multiple applications without having to be redeveloped for each application. This can save time and effort during the development process, and can also help to ensure that services are well-tested and reliable.</p><p>Reusability is an important concept in SOA because it allows developers to create more flexible and adaptable applications that can evolve and improve over time. For example, if a new service is needed to support a new feature, it can be added to the application without disrupting the existing services or the overall functionality of the application. Similarly, if a service needs to be updated or replaced, it can be done without affecting the rest of the application.</p><p>Overall, the ability to reuse services in multiple applications is a key benefit of SOA, and can help to save time and effort during development, as well as improve the reliability and flexibility of applications.</p><h2 class="wp-block-heading">Flexibility</h2><p>Flexibility is one of the key advantages of service-oriented architecture (SOA), and refers to the ability to easily add, remove, or update components without disrupting the overall functionality of the application. In SOA, each component is called a &#8220;service&#8221; and provides a specific function or piece of functionality. By designing an application as a network of independent services, developers can create more modular and scalable applications that are easier to maintain and update over time.</p><p>One of the key benefits of this modular approach is that it allows developers to make changes to an application without having to redesign the entire system. For example, if a new service is needed to support a new feature, it can be added to the application without disrupting the existing services or the overall functionality of the application. Similarly, if a service needs to be updated or replaced, it can be done without affecting the rest of the application.</p><p>Flexibility is an important concept in SOA because it enables developers to create applications that are adaptable to changing business requirements and technology needs. For example, if the requirements for an application change over time, it is relatively easy to update or replace services as needed without having to redesign the entire application. This makes it easier to evolve and improve an application over time without having to start from scratch.</p><p>Overall, the ability to easily add, remove, or update components is a key advantage of SOA, and can help to improve the flexibility and adaptability of applications. This can make it easier to respond to changing business needs and technology trends, and can help to ensure that applications remain relevant and effective over time.</p><h2 class="wp-block-heading">Summary (Still According to Chat.OpenAI)</h2><p>Service-oriented architecture (SOA) is a software design approach that focuses on building a network of independent, modular components that provide services to other components over a network in order to form a complete application. This architecture is designed to make it easier to develop, deploy, and manage complex applications by breaking them down into smaller, independent units of functionality that can be developed, tested, and deployed independently. SOA is often used in distributed systems and relies on communication protocols such as web services to enable components to interact with each other.</p><p>Some of the key features of SOA include modularity, independence, and the use of standardized communication protocols. Modularity refers to the idea of breaking down complex applications into smaller, independent components that can be developed, tested, and deployed independently. Independence refers to the idea that each component in an SOA-based system is independent and can be managed and maintained separately from other components. Standardized communication protocols refer to the use of specific rules and standards for enabling components to communicate with each other.</p><p>Some of the key benefits of SOA include reusability, flexibility, and the ability to easily adapt to changing business requirements and technology needs. Reusability refers to the ability to use components or services in multiple applications or contexts. Flexibility refers to the ability to easily add, remove, or update components without disrupting the overall functionality of the application. The ability to adapt to changing business requirements and technology needs refers to the ability of SOA to enable applications to evolve and improve over time without having to redesign the entire system.</p><hr
/><p><em>Does #OpenAI ( #OpenAIChat ) knows what is Service Oriented Architecture? #SOA #SoftwareEngineering #API</em><br
/><a
href='https://twitter.com/intent/tweet?url=https%3A%2F%2Ffrancescolelli.info%2Fmachine-learning%2Fopenai-what-is-service-oriented-architecture-soa%2F&#038;text=Does%20%23OpenAI%20%28%20%23OpenAIChat%20%29%20knows%20what%20is%20Service%20Oriented%20Architecture%3F%20%23SOA%20%23SoftwareEngineering%20%23API&#038;related' target='_blank' rel="noopener noreferrer" >Share on X</a><br
/><hr
/><h2 class="wp-block-heading">Final Note (from the author) on OpenAI answering to What is Service Oriented Architecture</h2><p>If we exclude the picture, the opening text and this closing text, the content is entirely generated by Chat.OpenAI using ChatGPT. The content of the answer looks sound and consistent and, from time to time, a bit redundant. In addition, the information are clearly correct and touch relevant sub-topics. Finally, strictly talking of the deepness of the content, it is sufficiently informative for a non expert but a bit useless for a domain expert as it can be classified as &#8220;a good introduction&#8221;.  Is short? Does OpenAI knows what is Service Oriented Architecture? Well&#8230; the answer looks correct <img
src="https://s.w.org/images/core/emoji/16.0.1/72x72/1f642.png" alt="🙂" class="wp-smiley" style="height: 1em; max-height: 1em;" /></p><p>If you are curious and you want to know more, <a
href="https://francescolelli.info/machine-learning/ai-assisted-writing-cartolibreria-biondi-adopts-a-researcher/" target="_blank" rel="noreferrer noopener">at this link you can find another example of AI assisted writing</a></p><p></p><p>The post <a
href="https://francescolelli.info/machine-learning/openai-what-is-service-oriented-architecture-soa/">OpenAI: What is Service Oriented Architecture (SOA) ?</a> appeared first on <a
href="https://francescolelli.info">Francesco Lelli</a>.</p> ]]></content:encoded> <wfw:commentRss>https://francescolelli.info/machine-learning/openai-what-is-service-oriented-architecture-soa/feed/</wfw:commentRss> <slash:comments>0</slash:comments> <post-id
xmlns="com-wordpress:feed-additions:1">2434</post-id> </item> <item><title>On Knowledge Graph and Artificial Intelligence</title><link>https://francescolelli.info/big-data/on-knowledge-graph-and-artificial-intelligence/</link> <comments>https://francescolelli.info/big-data/on-knowledge-graph-and-artificial-intelligence/#respond</comments> <dc:creator><![CDATA[Francesco Lelli]]></dc:creator> <pubDate>Tue, 22 Feb 2022 19:37:44 +0000</pubDate> <category><![CDATA[Big Data]]></category> <category><![CDATA[Machine Learning]]></category> <category><![CDATA[Research]]></category> <category><![CDATA[Artificial Intelligence]]></category> <category><![CDATA[Knowledge Graph]]></category> <category><![CDATA[nlp]]></category> <category><![CDATA[RDF]]></category> <category><![CDATA[research]]></category> <category><![CDATA[schema.org]]></category> <category><![CDATA[Semantic web]]></category> <category><![CDATA[SPARQL]]></category> <category><![CDATA[wikidata]]></category> <guid
isPermaLink="false">https://francescolelli.info/?p=2400</guid><description><![CDATA[<p>In this article I will try to give an introduction to what is a knowledge graph and what is its relationship with artificial intelligence. In a nutshell, we can see this relationship as bi-directional. On the one hand knowledge graphs are ways to represent information and can be used as data for machine learning models. [&#8230;]</p><p>The post <a
href="https://francescolelli.info/big-data/on-knowledge-graph-and-artificial-intelligence/">On Knowledge Graph and Artificial Intelligence</a> appeared first on <a
href="https://francescolelli.info">Francesco Lelli</a>.</p> ]]></description> <content:encoded><![CDATA[<p>In this article I will try to give an introduction to what is a knowledge graph and what is its relationship with artificial intelligence. In a nutshell, we can see this relationship as bi-directional. On the one hand knowledge graphs are ways to represent information and can be used as data for machine learning models. On the other hand AI models can be use for enriching information that are represented using a knowledge graph.</p><p>A knowledge graph is also known as a semantic network and represents a network of real-world entities. In other words represent &#8220;reality&#8221; as a combination of objects, events, situations, or concepts taking into account the relationship between them. This information is usually stored in a graph database and visualized as a graph structure, prompting the term knowledge “graph.”. The figure below can give you a visual idea of how these entities interlink among each other.</p><figure
class="wp-block-image size-large"><img
decoding="async" width="1024" height="770" data-attachment-id="2407" data-permalink="https://francescolelli.info/big-data/on-knowledge-graph-and-artificial-intelligence/attachment/conceptual_diagram_-_example-2/" data-orig-file="https://francescolelli.info/wp-content/uploads/2022/02/Conceptual_Diagram_-_Example.jpg" data-orig-size="1280,963" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="Conceptual_Diagram_-_Example" data-image-description="&lt;p&gt;Knowledge Graph and Artificial Intelligence&lt;/p&gt;
" data-image-caption="&lt;p&gt;Knowledge Graph and Artificial Intelligence&lt;/p&gt;
" data-medium-file="https://francescolelli.info/wp-content/uploads/2022/02/Conceptual_Diagram_-_Example-300x226.jpg" data-large-file="https://francescolelli.info/wp-content/uploads/2022/02/Conceptual_Diagram_-_Example-1024x770.jpg" src="https://francescolelli.info/wp-content/uploads/2022/02/Conceptual_Diagram_-_Example-1024x770.jpg?8011c3&amp;8011c3" alt="Knowledge Graph and Artificial Intelligence" class="wp-image-2407" title="Knowledge Graph and Artificial Intelligence" srcset="https://francescolelli.info/wp-content/uploads/2022/02/Conceptual_Diagram_-_Example-1024x770.jpg 1024w, https://francescolelli.info/wp-content/uploads/2022/02/Conceptual_Diagram_-_Example-300x226.jpg 300w, https://francescolelli.info/wp-content/uploads/2022/02/Conceptual_Diagram_-_Example-768x578.jpg 768w, https://francescolelli.info/wp-content/uploads/2022/02/Conceptual_Diagram_-_Example-600x451.jpg 600w, https://francescolelli.info/wp-content/uploads/2022/02/Conceptual_Diagram_-_Example.jpg 1280w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure><p>Intuitively, we can note that knowledge graphs take advantages of triples: subject-verb-qualifier in order to infer relationships among the data.</p><p>The knowledge graphs is at the core of the semantic web vision and standards like RDF and query languages like SPARQL are maintained by the w3c. A set of pointers for more information is available in this paper that I wrote specifically for <a
href="https://francescolelli.info/publication/interoperability-of-the-time-of-industry-4-0-and-the-internet-of-things/" target="_blank" rel="noreferrer noopener">interoperability in industry 4.0 and internet of things</a>. However sections 2.3. (of the paper) give an overview of the semantic stack.</p><h3 class="wp-block-heading">Existing tools and technologies that use knowledge graph</h3><p>A core set of technologies that are currently in productions and used companies like Google, Facebook, Apple includes the following:</p><ul
class="wp-block-list"><li><a
href="https://www.wikidata.org/wiki/Wikidata:Main_Page" target="_blank" rel="noreferrer noopener nofollow"><strong>Wikidata</strong></a>: Wikidata is a collaboratively edited multilingual knowledge graph hosted by the Wikimedia Foundation. It is a free and open knowledge base that can be read and edited by both humans and machines.</li><li><strong><a
href="https://schema.org/" target="_blank" rel="noreferrer noopener nofollow">schema.org</a></strong> is an organization that publishes documentations and guidelines to using structured data mark-up on web-pages. Its main objective is to standardize HTML tags to be used by webmasters for creating rich results about a certain topic of interest.</li><li><a
href="https://developers.google.com/knowledge-graph" target="_blank" rel="noreferrer noopener nofollow"><strong>Google Knowledge Graph</strong></a> is a knowledge base used by Google and its services to enhance its search engine&#8217;s results with information gathered from a variety of sources.</li></ul><p>These technologies are strictly focusing on the data instead and the video below share a practical approach that focus on the data and how to combine them. Note that it does not consider any AI approach but focus on how to combine existing available data and their relationships.</p><figure
class="wp-block-embed-youtube wp-block-embed is-type-video is-provider-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div
class="wp-block-embed__wrapper"> <iframe
title="Strands of the modern semantic web: schema.org, Wikidata, and the Knowledge Graph" width="800" height="450" src="https://www.youtube.com/embed/UzyIlAXOAP8?feature=oembed&#038;enablejsapi=1&#038;origin=https://francescolelli.info" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe></div></figure><p>However, it contains very nice examples and give an intuitive idea on how Wikidata, Schema.org and Knowledge graph works.</p><h3 class="wp-block-heading">Research on Knowledge Graph and Artificial Intelligence</h3><p>Now that we have an idea of what are the basic techniques for representing information we need to understand how we can use machine learning techniques for enriching these information and the expressiveness of the models. What follows is a set of seminars that discuss the knowledge graph from a more theoretical point of view and consider open research questions. This is the first part:</p><figure
class="wp-block-embed-youtube wp-block-embed is-type-video is-provider-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div
class="wp-block-embed__wrapper"> <iframe
title="CS520: Knowledge Graph Seminar Session 1 (Spring 2020)" width="800" height="450" src="https://www.youtube.com/embed/bvwjG-3qAmY?feature=oembed&#038;enablejsapi=1&#038;origin=https://francescolelli.info" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe></div></figure><p>Below you can find the second part. It also contains very nice real life case studies. Including how Siri works!</p><figure
class="wp-block-embed-youtube wp-block-embed is-type-video is-provider-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div
class="wp-block-embed__wrapper"> <iframe
title="CS 520: Knowledge Graph Seminar Session 2 (Spring 2020)" width="800" height="450" src="https://www.youtube.com/embed/ZWM-Dlw3VCM?feature=oembed&#038;enablejsapi=1&#038;origin=https://francescolelli.info" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe></div></figure><p>If you are curious and would like to know more about the topic I encourage you in browsing the YouTube channel for more videos.</p><hr
/><p><em>On #Knowledge Graph and #ArtificialIntelligence : how to combine data and models for better information. #BigData #DataScience</em><br
/><a
href='https://twitter.com/intent/tweet?url=https%3A%2F%2Ffrancescolelli.info%2Fbig-data%2Fon-knowledge-graph-and-artificial-intelligence%2F&#038;text=On%20%23Knowledge%20Graph%20and%20%23ArtificialIntelligence%20%3A%20how%20to%20combine%20data%20and%20models%20for%20better%20information.%20%23BigData%20%23DataScience&#038;related' target='_blank' rel="noopener noreferrer" >Share on X</a><br
/><hr
/><p>The post <a
href="https://francescolelli.info/big-data/on-knowledge-graph-and-artificial-intelligence/">On Knowledge Graph and Artificial Intelligence</a> appeared first on <a
href="https://francescolelli.info">Francesco Lelli</a>.</p> ]]></content:encoded> <wfw:commentRss>https://francescolelli.info/big-data/on-knowledge-graph-and-artificial-intelligence/feed/</wfw:commentRss> <slash:comments>0</slash:comments> <post-id
xmlns="com-wordpress:feed-additions:1">2400</post-id> </item> <item><title>Natural Language Processing for Unstructured Text in Colloquial Conversations</title><link>https://francescolelli.info/thesis/natural-language-processing-for-unstructured-text-in-colloquial-conversations/</link> <comments>https://francescolelli.info/thesis/natural-language-processing-for-unstructured-text-in-colloquial-conversations/#respond</comments> <dc:creator><![CDATA[Francesco Lelli]]></dc:creator> <pubDate>Thu, 22 Jul 2021 13:38:40 +0000</pubDate> <category><![CDATA[Thesis]]></category> <category><![CDATA[natural language processing]]></category> <category><![CDATA[natural language processing data]]></category> <category><![CDATA[natural language processing interview]]></category> <category><![CDATA[nlp]]></category> <category><![CDATA[NPL topic modelling]]></category> <guid
isPermaLink="false">https://francescolelli.info/?p=1554</guid><description><![CDATA[<p>Title: Natural Language Processing for Unstructured Text in Colloquial Conversations&#160; (tentative) Mentor: Francesco Lelli Have you ever received an eMail from a friend proposing a vacation to Venice and 30 second later you stumble into an advertisement proposing cheap flights or hotels for that very same destination?&#160; Welcome to the world of Natural Language Processing [&#8230;]</p><p>The post <a
href="https://francescolelli.info/thesis/natural-language-processing-for-unstructured-text-in-colloquial-conversations/">Natural Language Processing for Unstructured Text in Colloquial Conversations</a> appeared first on <a
href="https://francescolelli.info">Francesco Lelli</a>.</p> ]]></description> <content:encoded><![CDATA[<p>Title: <strong>Natural Language Processing for Unstructured Text in Colloquial Conversations</strong>&nbsp; (tentative)</p><p><strong>Mentor</strong>: Francesco Lelli</p><p>Have you ever received an eMail from a friend proposing a vacation to Venice and 30 second later you stumble into an advertisement proposing cheap flights or hotels for that very same destination?&nbsp; Welcome to the world of Natural Language Processing (NLP).</p><p>This thesis will investigate the limits that machines have
in processing and understanding colloquial conversations. If you are curious to
know what can actually be automatized by Artificial Intelligence this thesis is
for you.</p><div
class="wp-block-image"><figure
class="alignright is-resized"><img
decoding="async" data-attachment-id="188" data-permalink="https://francescolelli.info/thesis/promotion-of-scientific-work-thesis-proposal/attachment/thesis-proposal/" data-orig-file="https://francescolelli.info/wp-content/uploads/2018/12/thesis-proposal.jpg" data-orig-size="500,350" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="thesis-proposal" data-image-description="&lt;p&gt;Natural Language Processing for Unstructured Text in Colloquial Conversations   &lt;/p&gt;
" data-image-caption="&lt;p&gt;Natural Language Processing for Unstructured Text in Colloquial Conversations   &lt;/p&gt;
" data-medium-file="https://francescolelli.info/wp-content/uploads/2018/12/thesis-proposal-300x210.jpg" data-large-file="https://francescolelli.info/wp-content/uploads/2018/12/thesis-proposal.jpg" src="https://francescolelli.info/wp-content/uploads/2018/12/thesis-proposal.jpg?8011c3&amp;8011c3" alt="Natural Language Processing for Unstructured Text in Colloquial Conversations   " class="wp-image-188" width="320" height="224" srcset="https://francescolelli.info/wp-content/uploads/2018/12/thesis-proposal.jpg 500w, https://francescolelli.info/wp-content/uploads/2018/12/thesis-proposal-300x210.jpg 300w" sizes="(max-width: 320px) 100vw, 320px" /><figcaption>Natural Language Processing for Unstructured Text in Colloquial Conversations </figcaption></figure></div><p>You will be able to touch with your own hands the technological
and business limits that this technology is facing in its adoption into our
everyday life as well as propose your personal ideas for improving the present
state of the art.</p><p>Moreover, you will have the opportunity to master techniques of NLP for parsing and extracting semantic information from any kind of text.</p><p>Your claims will be validated using popular NLP libraries (yes, you will have to write a few line of code!) using an existing dataset as benchmark (no, you do not have to collect data!).</p><p>&nbsp;If you are curious
and you what to know more about the topic, I recommend you the following:</p><ul
class="wp-block-list"><li>A generic article in Wikipedia about Natural
Language Processing: <a
href="https://en.wikipedia.org/wiki/Natural_language_processing">https://en.wikipedia.org/wiki/Natural_language_processing</a></li><li>Watch this YouTube video (knowledge of Python
will help you in understanding it)</li></ul><figure
class="wp-block-embed-youtube wp-block-embed is-type-video is-provider-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio"><div
class="wp-block-embed__wrapper"> <iframe
title="Natural Language Processing in Python" width="800" height="450" src="https://www.youtube.com/embed/xvqsFTUsOmc?feature=oembed&#038;enablejsapi=1&#038;origin=https://francescolelli.info" frameborder="0" allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share" referrerpolicy="strict-origin-when-cross-origin" allowfullscreen></iframe></div></figure><ul
class="wp-block-list"><li>A few keywords that you may want to use in google scholar: <em>natural language processing, NLP topic modelling, natural language processing interview,</em> <em>natural language processing data.</em></li></ul><p>This project does not involve an internship. Instead, it will try to have a high academic relevance and theoretical contribution and, based on the quality of your work the candidate may be able to publish the results in the proceedings and scientific journals. If this topic triggers your intellectual curiosity, let me know.</p><p> If you are interested in the topic, do not hesitate to <a
href="https://francescolelli.info/contacts/">contact us</a> for more information.</p><p>The post <a
href="https://francescolelli.info/thesis/natural-language-processing-for-unstructured-text-in-colloquial-conversations/">Natural Language Processing for Unstructured Text in Colloquial Conversations</a> appeared first on <a
href="https://francescolelli.info">Francesco Lelli</a>.</p> ]]></content:encoded> <wfw:commentRss>https://francescolelli.info/thesis/natural-language-processing-for-unstructured-text-in-colloquial-conversations/feed/</wfw:commentRss> <slash:comments>0</slash:comments> <post-id
xmlns="com-wordpress:feed-additions:1">1554</post-id> </item> <item><title>Master Thesis Plus Talpa Internship Opportunities in Big Data and Artificial Intelligence</title><link>https://francescolelli.info/thesis/master-thesis-plus-talpa-internship-opportunities-in-big-data-and-artificial-intelligence/</link> <comments>https://francescolelli.info/thesis/master-thesis-plus-talpa-internship-opportunities-in-big-data-and-artificial-intelligence/#respond</comments> <dc:creator><![CDATA[Francesco Lelli]]></dc:creator> <pubDate>Fri, 15 May 2020 17:33:14 +0000</pubDate> <category><![CDATA[Thesis]]></category> <category><![CDATA[Artificial Intelligence]]></category> <category><![CDATA[Internship]]></category> <category><![CDATA[machine learning]]></category> <category><![CDATA[master]]></category> <category><![CDATA[nlp]]></category> <category><![CDATA[predictor]]></category> <category><![CDATA[talpa]]></category> <category><![CDATA[thesis proposal]]></category> <category><![CDATA[Web 2.0]]></category> <guid
isPermaLink="false">https://francescolelli.info/?p=1961</guid><description><![CDATA[<p>Master Thesis Plus Talpa Internship Opportunities in Big Data and Artificial Intelligence Mentor at TiU: Francesco Lelli&#160; Mentors at Talpa: Anca Dumitrache and/or Ricardo Fabian Guevara There are some internships opportunities available at Talpa. You will have the opportunity to develop your master&#8217;s thesis in collaboration with the AI division of one of the most [&#8230;]</p><p>The post <a
href="https://francescolelli.info/thesis/master-thesis-plus-talpa-internship-opportunities-in-big-data-and-artificial-intelligence/">Master Thesis Plus Talpa Internship Opportunities in Big Data and Artificial Intelligence</a> appeared first on <a
href="https://francescolelli.info">Francesco Lelli</a>.</p> ]]></description> <content:encoded><![CDATA[<p><strong>Master Thesis Plus Talpa Internship Opportunities in Big Data and Artificial Intelligence</strong></p><p><strong>Mentor at TiU</strong>: Francesco Lelli&nbsp;</p><p><strong>Mentors at Talpa</strong>: <a
href="http://ancad.ro/">Anca Dumitrache</a> and/or <a
href="https://www.linkedin.com/in/ricardo-fabian-guevara/">Ricardo Fabian Guevara</a></p><div
class="wp-block-image"><figure
class="alignright size-large is-resized"><img
loading="lazy" decoding="async" data-attachment-id="1962" data-permalink="https://francescolelli.info/thesis/master-thesis-plus-talpa-internship-opportunities-in-big-data-and-artificial-intelligence/attachment/talpa_logo/" data-orig-file="https://francescolelli.info/wp-content/uploads/2020/05/Talpa_logo.png" data-orig-size="520,1000" data-comments-opened="1" data-image-meta="{&quot;aperture&quot;:&quot;0&quot;,&quot;credit&quot;:&quot;&quot;,&quot;camera&quot;:&quot;&quot;,&quot;caption&quot;:&quot;&quot;,&quot;created_timestamp&quot;:&quot;0&quot;,&quot;copyright&quot;:&quot;&quot;,&quot;focal_length&quot;:&quot;0&quot;,&quot;iso&quot;:&quot;0&quot;,&quot;shutter_speed&quot;:&quot;0&quot;,&quot;title&quot;:&quot;&quot;,&quot;orientation&quot;:&quot;0&quot;}" data-image-title="Talpa_logo" data-image-description="&lt;p&gt;Talpa Internship Opportunities &lt;/p&gt;
" data-image-caption="&lt;p&gt;Talpa Internship Opportunities &lt;/p&gt;
" data-medium-file="https://francescolelli.info/wp-content/uploads/2020/05/Talpa_logo-156x300.png" data-large-file="https://francescolelli.info/wp-content/uploads/2020/05/Talpa_logo.png" src="https://francescolelli.info/wp-content/uploads/2020/05/Talpa_logo.png?8011c3&amp;8011c3" alt="Talpa Internship" class="wp-image-1962" width="153" height="294" srcset="https://francescolelli.info/wp-content/uploads/2020/05/Talpa_logo.png 520w, https://francescolelli.info/wp-content/uploads/2020/05/Talpa_logo-156x300.png 156w" sizes="(max-width: 153px) 100vw, 153px" /><figcaption>Talpa Internship Opportunities</figcaption></figure></div><p>There are some internships opportunities available at Talpa. You will have the opportunity to develop your master&#8217;s thesis in collaboration with the AI division of one of the most innovative media companies in the Netherlands .</p><p>If you have a go-get attitude with the desire to expand your knowledge and expertise in the area of big data and artificial intelligence, this is probably the internship that you are looking for.</p><p>Topics for this thesis include Natural Language Processing (NLP), Machine Learning and Predictors. At the same time you are encouraged to propose your own idea as well.</p><h2 class="wp-block-heading">Knowledge and Skills:</h2><p>Programming, preferably Python, and/or statistical skills.</p><h2 class="wp-block-heading">Interested in Joining Talpa?</h2><p>Here you can find a short presentation of the company:</p><iframe
src="https://www.linkedin.com/embed/feed/update/urn:li:ugcPost:6532146857345368064" height="527" width="504" frameborder="0" allowfullscreen="" title="Embedded post"></iframe><h2 class="wp-block-heading">Where to Read More:</h2><p>The following ideas are possible internship opportunities and related topic for your thesis. You may want to use them for forming an idea of the kind of jobs that you will be doing at Talpa as well as for <a
href="https://francescolelli.info/thesis/how-to-write-a-thesis-proposal-or-a-research-proposal-a-few-tips/">developing your research proposal (here you can find a few tips for that)</a>.</p><h3 class="wp-block-heading"><strong>Swimlane for Trending on Social Media</strong></h3><p>Both KIJK (video streaming platform) and JUKE (audio streaming platform) present their content separated into swimlanes on the front page. For instance, one swimlane contains a list of TV shows that were popular in the previous days. We would like to create a new swimlane that contains items that were trending on social media. Taking Twitter as a data source, the project will go through the following steps:</p><ul
class="wp-block-list"><li>(1) extract tweets about popular TV shows and/or radio shows,</li><li>(2) perform entity linking to match them to the shows in our database,</li><li>(3) aggregate the results to get the most popular shows in one swimlane.</li></ul><p><em>Project:</em> KIJK and/or JUKE</p><h3 class="wp-block-heading"><strong>Automatic Teaser Tweet Creation</strong></h3><p>Starting with a (textual) description of a TV show episode or radio program, we would like to generate teaser tweets about the show that are meant to generate anticipation on social media. The underlying task would be a summarization problem, where the program description is mapped to a short tweet about it. The tweet should contain relevant information about the program, but not reveal any spoilers. Either an extractive (entity + relation extraction) or abstractive method could be applied.</p><p><em>Source:</em><a
href="https://urldefense.com/v3/__https:/www.aclweb.org/anthology/N19-1398/__;!!PfSLnZU!hDuCqMfi3gtTFLf-YHrfLf8ieTPLJSQPg0-UuDsiwZe-KZp-HLtGuFrIYHwFEBhe_zZcgD_I$"> https://www.aclweb.org/anthology/N19-1398/</a></p><p><em>Project:</em> KIJK and/or JUKE</p><h3 class="wp-block-heading"><strong>Exploring Pair-wise Learning-to-Rank</strong></h3><p>Talpa current recommender system uses ALS, a point-wise learning-to-rank approach, where the learning objective is based on modeling the score of a given item (i.e. similarly to how regression works). Alternative methods of doing learning-to-rank are pair-wise (learning objective is to model the ranking of a pair of items relative to each other) and list-wise learning to rank (learning objective is calculated over the entire list of items). The project goal is to investigate different learning objectives and find out:</p><ul
class="wp-block-list"><li>(1) how they perform relatively to the point-wise method,</li><li>(2) if there are subsets of data where this method works better/worse.</li></ul><p><em>Source:</em><a
href="https://urldefense.com/v3/__https:/medium.com/@nikhilbd/pointwise-vs-pairwise-vs-listwise-learning-to-rank-80a8fe8fadfd__;!!PfSLnZU!hDuCqMfi3gtTFLf-YHrfLf8ieTPLJSQPg0-UuDsiwZe-KZp-HLtGuFrIYHwFEBhe_zBcYqCK$">https://medium.com/@nikhilbd/pointwise-vs-pairwise-vs-listwise-learning-to-rank-80a8fe8fadfd</a></p><p><em>Project:</em> KIJK and/or JUKE</p><h3 class="wp-block-heading"><strong>Automatic Playlist Generation</strong></h3><p>JUKE music player features a lot of <em>non-stop music</em> playlists that are manually created by an editor, usually selecting music from a given genre (e.g. hard rock non-stop radio). We would like to see whether these playlists can be generated by AI and what the quality is. This can be approached as a song clustering problem, where the feature space could contain the genre, artist, as well as other audio features.</p><p><em>Project</em>: JUKE</p><h3 class="wp-block-heading"><strong>Google AdWords for Video</strong></h3><p>Advertisers can buy Google search keywords to show their ad in conjunction with them. We would like to see whether it is possible to do something similar within videos, too. In this way, contextual advertising in different medias becomes feasible. The steps involved in this project are:</p><ul
class="wp-block-list"><li>(1) generate textual metadata of video (either based on transcript, or other features of the video; there might be some existing video metadata as well),</li><li>(2) match video textual metadata with ad keywords.</li></ul><h3 class="wp-block-heading"><strong>Exploring Seasonal Trends as RecSys Features</strong></h3><p>Our domain experts know that both video and audio streaming trends are highly influenced by seasonality (e.g. Sky Radio becomes very popular over Christmas). The goal of this project is to:</p><ul
class="wp-block-list"><li>(1)  identify these trends by studying user listening data, then</li><li>(2) incorporating these trends into our prediction model (e.g. recommend Sky Radio to fans of Christmas music, but only over the Christmas holidays).</li></ul><p><em>Challenge:</em> a lot of data may be needed to accomplish this.</p><p><em>Project:</em> KIJK and/or JUKE</p><h3 class="wp-block-heading"><strong>Radio Station Embeddings</strong></h3><p><em>Goal</em>: produce embedding with various sizes similar to <a
href="https://urldefense.com/v3/__https:/nlp.stanford.edu/projects/glove/__;!!PfSLnZU!hDuCqMfi3gtTFLf-YHrfLf8ieTPLJSQPg0-UuDsiwZe-KZp-HLtGuFrIYHwFEBhe_4KnEqfT$">Glove</a>&nbsp;but for radio stations that reflect radio station similarity with euclidean distance.</p><p><em>Value</em>: many of Talpa challenges, like radio to radio similarity, could be tackled with this approach. Our RecSys can also use the radio embedding as a feature to determine if a radio channel is a good match for a user. This could also greatly alleviate the cold start problem when we introduce new radio stations.</p><p><em>Approach</em>: this is an unexplored problem but there has been previous research and practical work done on <a
href="https://urldefense.com/v3/__https:/benanne.github.io/2014/08/05/spotify-cnns.html__;!!PfSLnZU!hDuCqMfi3gtTFLf-YHrfLf8ieTPLJSQPg0-UuDsiwZe-KZp-HLtGuFrIYHwFEBhe_ywwSc1a$">audio embeddings</a>. In the simplest form, a radio station embedding could be just a bag of features like genre frequencies, languages and origins of artists, but also properties from the target audience like age range. Going beyond that, it could use an average of the embeddings of representative songs on it in a given past period. Another valuable asset of Talpa is its understanding of usage behavior, so radio station features could also be calculated based on the type of person who listens to it, the frequency of listening and other seasonal behavior.</p><p>At the same time, there is a temporal aspect to take into consideration when analyzing user preference of radio stations.  Therefore, we cannot expect the embeddings to remain the same forever. Consequently a refresh period is to be expected (even word embeddings need to account for word semantic drifting, only their refresh window is larger).</p><p><em>Project</em>: Juke</p><h3 class="wp-block-heading"><strong>Podcast Embeddings for JUKE</strong></h3><p>Similar to the radio embeddings project, Talpa is interested in creating embeddings for the podcasts available on JUKE. Podcast recommendations suffer from cold start problem even more than radio, since there is usually a high volume of items that are published continuously. The features that can be used for podcast embeddings are also slightly different than typical audio embedding features based on e.g. musical genre.</p><p> <em>Project</em>: Juke</p><h2 class="wp-block-heading">Interested in a Talpa internship? Check also the Following:</h2><div
class="wp-block-file"><a
href="https://francescolelli.info/wp-content/uploads/2020/05/TiU-Talpa.pdf?8011c3&amp;8011c3">A presentation from Francesco, Anca and Fabian about TiU and Talpa Internship Opportunities </a><a
href="https://francescolelli.info/wp-content/uploads/2020/05/TiU-Talpa.pdf?8011c3&amp;8011c3" class="wp-block-file__button" download>Download the Presentation</a></div><p><a
href="https://videocollege.uvt.nl/Mediasite/Play/767355b24a5e4307811f59f802f1c04c1d"><strong>Webinar </strong>about Talpa Internship Opportunities  (Available for Tilburg Student Only)</a></p><h2 class="wp-block-heading">Interested in Applying?</h2><p>Send an email to <a
href="https://francescolelli.info/contacts/">Francesco Lelli with CV, Short motivation letter, project(s) that interest you and (optional) draft research proposal. </a>In the case you would like to propose a particular project please put extra attention to your research proposal.</p><p><em>A selected list </em>of<em> candidates will be interviewe</em>d<em> by TALPA that has the ultimate saying on accepting you for an internship.</em><strong><em> Succes, Good luck, In Bocca al Lupo! </em></strong></p><p>The post <a
href="https://francescolelli.info/thesis/master-thesis-plus-talpa-internship-opportunities-in-big-data-and-artificial-intelligence/">Master Thesis Plus Talpa Internship Opportunities in Big Data and Artificial Intelligence</a> appeared first on <a
href="https://francescolelli.info">Francesco Lelli</a>.</p> ]]></content:encoded> <wfw:commentRss>https://francescolelli.info/thesis/master-thesis-plus-talpa-internship-opportunities-in-big-data-and-artificial-intelligence/feed/</wfw:commentRss> <slash:comments>0</slash:comments> <post-id
xmlns="com-wordpress:feed-additions:1">1961</post-id> </item> </channel> </rss>