Language technology in Africa: Prospects Arvi Hurskainen University of Helsinki.

Slides:



Advertisements
Similar presentations
© 2000 XTRA Translation Services Is MT technology available today ready to replace human translators?
Advertisements

Making Europe easy for citizens and businesses Practical tools for local and regional authorities OPEN DAYS for Municipalities and Regions Brussels, 6.
LABELING TURKISH NEWS STORIES WITH CRF Prof. Dr. Eşref Adalı ISTANBUL TECHNICAL UNIVERSITY COMPUTER ENGINEERING 1.
Configuration management
CS Morphological Parsing CS Parsing Taking a surface input and analyzing its components and underlying structure Morphological parsing:
1 Egyptian Ministry of Communications and Information Technology Research and Development Centers of Excellence Initiative Data Mining and Computer Modeling.
A Syntactic Translation Memory Vincent Vandeghinste Centre for Computational Linguistics K.U.Leuven
CALTS, UNIV. OF HYDERABAD. SAP, LANGUAGE TECHNOLOGY CALTS has been in NLP for over a decade. It has participated in the following major projects: 1. NLP-TTP,
1 Developing Statistic-based and Rule-based Grammar Checkers for Chinese ESL Learners Howard Chen Department of English National Taiwan Normal University.
How do we work in a virtual multilingual classroom? A virtual multilingual classroom with Moodle and Apertium Cultural and Linguistic Practices in the.
Assessment and error correction. Reasons for assessment  a teacher is accountable for children’s progress to the children themselves, to the parents,
What is a corpus?* A corpus is defined in terms of  form  purpose The word corpus is used to describe a collection of examples of language collected.
Resources Primary resources – Lexicons, structured vocabularies – Grammars (in widest sense) – Corpora – Treebanks Secondary resources – Designed for a.
Course Summary LING 575 Fei Xia 03/06/07. Outline Introduction to MT: 1 Major approaches –SMT: 3 –Transfer-based MT: 2 –Hybrid systems: 2 Other topics.
Machine Learning in Natural Language Processing Noriko Tomuro November 16, 2006.
Semi-Automatic Learning of Transfer Rules for Machine Translation of Low-Density Languages Katharina Probst April 5, 2002.
The LC-STAR project (IST ) Objectives: Track I (duration 2 years) Specification and creation of large word lists and lexica suited for flexible.
1.3 Executing Programs. How is Computer Code Transformed into an Executable? Interpreters Compilers Hybrid systems.
Knowledge Process Outsourcing1 Turning Information into Knowledge... for YOU The Gyaan Team.
Phonetics, Phonology, Morphology and Syntax
Statistical Natural Language Processing. What is NLP?  Natural Language Processing (NLP), or Computational Linguistics, is concerned with theoretical.
European Language Learning for Life-Long Learning: Issues in Cyprus Victoria Kalogerou Cyprus Academic Research Institute 66, Metochiou str. Nicosia, Cyprus.
Machine Translation Dr. Radhika Mamidi. What is Machine Translation? A sub-field of computational linguistics It investigates the use of computer software.
Computational Methods to Vocalize Arabic Texts H. Safadi*, O. Al Dakkak** & N. Ghneim**
Lemmatization Tagging LELA /20 Lemmatization Basic form of annotation involving identification of underlying lemmas (lexemes) of the words in.
CIG Conference Norwich September 2006 AUTINDEX 1 AUTINDEX: Automatic Indexing and Classification of Texts Catherine Pease & Paul Schmidt IAI, Saarbrücken.
Intellectual Property Rights (IPR) Helpdesk A presentation by Daniela Nolte.
Computational Investigation of Palestinian Arabic Dialects
Recent Activities of Speech Corpora and Assessment in Korea Yong-Ju Lee Wonkwang University Korea.
 To explain the importance of software configuration management (CM)  To describe key CM activities namely CM planning, change management, version management.
Biosafety Clearing House Training Workshop December, 2008 Bangkok.
1 Computational Linguistics Ling 200 Spring 2006.
Roadmap for Language Resources and Evaluation in a Multilingual Environment Minority Languages in the African Context Justus Roux Centre for Language and.
The CoNLL-2013 Shared Task on Grammatical Error Correction Hwee Tou Ng, Yuanbin Wu, and Christian Hadiwinoto 1 Siew.
Teaching Productive Skills Which ones are they? Writing… and… Speaking They have similarities and Differences.
Natural Language Processing Guangyan Song. What is NLP  Natural Language processing (NLP) is a field of computer science and linguistics concerned with.
Introducing MorphoLogic to LIRICS Gábor Prószéky MorphoLogic Pázmány Péter Catholic University Faculty.
Combining terminology resources and statistical methods for entity recognition: an evaluation Angus Roberts, Robert Gaizauskas, Mark Hepple, Yikun Guo.
Péter Schönhofen – Ad Hoc Hungarian → English – CLEF Workshop 20 Sep 2007 Performing Cross-Language Retrieval with Wikipedia Participation report for Ad.
Presented by Madhuriya Kumar Dutta Trade and Investment Facilitation Department Mekong Institute, Thailand 16 May 2012.
Eurostat Expression language (EL) in Eurostat SDMX - TWG Luxembourg, 5 Jun 2013 Adam Wroński.
Using Short-Answer Format Questions for an English Grammar Tutoring System Conceptualization & Research Planning Jonggun Gim.
“ BIRD Project“ 1 Broadband Access, Innovation & Regional Development” Broadband Access, Innovation & Regional Development” Project Description Ulrich.
What you have learned and how you can use it : Grammars and Lexicons Parts I-III.
Computational linguistics A brief overview. Computational Linguistics might be considered as a synonym of automatic processing of natural language, since.
RESEARCH DESIGN & CORPUS COMPILATION. Corpus design is intrinsic and a fundamental part of the analysis. It is guided by the RQ and affects the results.
For Monday Read chapter 26 Last Homework –Chapter 23, exercise 7.
Auckland 2012Kilgarriff: NLP and Corpus Processing1 The contribution of NLP: corpus processing.
Catia Cucchiarini, Walter Daelemans and Helmer Strik Strengthening the Dutch Language and Speech Technology Infrastructure Catia Cucchiarini, Walter Daelemans.
Africa Programme on Gender Statistics Status of implementation United Economic Commission for Africa Meeting of Committee of Directors General November.
For Friday Finish chapter 23 Homework –Chapter 23, exercise 15.
Generality and Openness in Enabling Methodologies for Morphology and Text Processing Anssi Yli-Jyrä Department of General Linguistics, University of Helsinki.
1 STO A Lexical Database of Danish for Language Technology Applications Anna Braasch Center for Sprogteknologi Copenhagen SPINN Seminar, October 27, 2001.
Distant Course Master English English Language Course For Masters of Mathematical and Mechanical Faculty Saint Petersburg State University The Faculty.
Shallow Parsing for South Asian Languages -Himanshu Agrawal.
Introduction A field survey of Dutch language resources has been carried out within the framework of a project launched by the Dutch Language Union (Nederlandse.
For Monday Read chapter 26 Homework: –Chapter 23, exercises 8 and 9.
Multi-word Expressions and CG How should MWEs be described?
COURSE AND SYLLABUS DESIGN
A Simple English-to-Punjabi Translation System By : Shailendra Singh.
Big Data: Every Word Managing Data Data Mining TerminologyData Collection CrowdsourcingSecurity & Validation Universal Translation Monolingual Dictionaries.
Work Package 2 Eva Heckl 1/10/2014 Feasibility study on an internet-based e-platform for women entrepreneurs.
Tasneem Ghnaimat. Language Model An abstract representation of a (natural) language. An approximation to real language Assume we have a set of sentences,
Indian Community Languages Schools Parents and Teachers Conference July 2017.
Approaches to Machine Translation
Eurostat D2 – Regional Indicators and Geographical Information
Machine Learning in Natural Language Processing
Scanning the environment: The global perspective on the integration of non-traditional data sources, administrative data and geospatial information Sub-regional.
E-LEARNING ON EU - ADULT EDUCATION FOR TEACHERS
Approaches to Machine Translation
Presentation transcript:

Language technology in Africa: Prospects Arvi Hurskainen University of Helsinki

Why LT for African languages? LT is currently considered a necessary field of development in most languages. Why should African languages be neglected?

Current state Compared with other continents, LT in Africa takes its first steps.

Current state The latest issue of MultiLingual, a periodical with 15,000 subscribers, was supposed to concentrate on LT in Africa. The only article discussing genuine LT was the one describing Swahili Language Manager (SALAMA) Another article on Africa was written by a freelancer on public domain localization in South Africa. That was all for Africa.

In LT the gap between well- resourced and poorly resourced languages is bigger than in any other field. My impression is that even today half of global investments on LT goes to English.

African languages are triply handicapped: –Commercial sector not interested –Local governments poor – little or no public support –African languages have features that need different approaches than those used in main-stream LT

Language technology (LT) Labour-intensive –Trivial results quickly –Useful results require several man-years of work Although the development of LT is expensive, the results can be very rewarding

Language technology (LT) LT built on a modular basis can result in several kinds of applications –An additional application can make use of earlier modules and thus costs can be reduced Once developed, LT applications can be widely distributed with minimal cost

Language technology (LT) Experience of LT in other languages available –Wrong tracks can be avoided –Solutions applied in other languages can be tested in African languages

Language technology (LT) LT of African languages NOT mere application from other languages African languages have special features –Very rich morphology –Noun classes –Complex verb formation –Serial verbs –Non-concatenative processes –Reduplication –Inflecting idioms and other multi-word expressions –Tones Lexical Grammatical

Feasibility of LT in Africa Question: If African languages have several special features regarded as ’problems’, is it feasible to develop language technology for those languages? Answer: Some ‘problems’ can be turned into advantages

Rich morphology Requires efficient development environment to succeed, but Can be very useful in disambiguation (= choice of correct interpretation) and syntactic analysis.

Poor morphology vs. rich morphology Poor morphology (e.g. English) –Easy to analyze morphologically, but –Difficult to disambiguate and analyze syntactically and semantically Rich morphology (e.g. Bantu languages) –Difficult to analyze morphologically, but –Less difficult to disambiguate and analyze syntactically

LT applications Applications for end-users: –spelling correctors –hyphenators –grammar checkers –thesauri –electronic dictionaries –MT applications –multilingual speech applications

LT applications Applications for developers: –dictionary compilers –dictionary evaluators –MT development environments –information retrieval and data mining

Machine Translation (MT) Text-to-text MT –Official texts (government, AU, UN, SADDEC, business, manuals, teaching) –News texts –Communication through in international organizations Speech-to-speech MT –Simultaneous interpretation –Multilingual phone calls

Phases of speech-to-speech MT 1.Speech recognition Transforming speech signal to text 2.Tokenization of text Identifying ‘words’, punctuation marks, diacritics etc. 3.Morphological analysis Analyzing each morphological unit and providing it with codes (tags) 4.Morphological disambiguation Determining correct interpretation

Phases of speech-to-speech MT 5. Syntactic mapping Providing words with syntactic tags 6.Semantic disambiguation Choosing the correct semantic meaning 7.Multi-word units Isolating multi-word expressions and giving correct interpretation Idioms Proverbs Adjectival expressions Compound nouns Serial verb constructions

Phases of speech-to-speech MT 8.Managing word order Re-ordering word sequences to meet the rules of the target language Inclusion and exclusion of pronouns and articles 9.Producing surface forms of target language 10.Clean text in target language 11.Text-to-speech conversion

1. Tokenization *mtu aliyepata taarifa alipiga simu, kukaa na kungoja

2. Morphological analysis *mtu "mtu" N CAP 1/2-SG { the } { man } aliyepata "pata" V 1/2-SG3-SP VFIN { he/she } PAST 1/2-SG-REL { who } z [pata] { get } SVO taarifa "taarifa" N 9/10-SG { the } { report } AR "taarifa" N 9/10-PL { the } { report } AR alipiga "piga" V 1/2-SG3-SP VFIN { he/she } PAST z [piga] { hit } SVO ACT "piga" V 1/2-SG3-SP VFIN { he/she } PR:a 5/6-SG-OBJ OBJ { it } z [piga] { hit } SVO ACT simu "simu" N 9/10-SG { the } { telephone } "simu" N 9/10-SG { the } { type of sardine or sprat } AN "simu" N 9/10-PL { the } { telephone } "simu" N 9/10-PL { the } { type of sardine or sprat } AN, "," COMMA {, } kukaa "kaa" V INF { to } z [kaa] { sit } SV SVO "kaa" V INF NO-TO z [kaa] { sit } SV SVO na "na" CC { and } "na" AG-PART { by } "na" PREP { with } "na" NA-POSS { of } "na" ADV NOART { past } kungoja "ngoja" V INF { to } z [ngoja] { wait } SV "ngoja" V INF NO-TO z [ngoja] { wait } SV

3. Disambiguation, isolating MWE *mtu "mtu" N 1/2-SG { the } { man aliyepata "pata" V 1/2-SG3-SP VFIN { he/she } PAST 1/2-SG-REL { who } z [pata] { get } taarifa "taarifa" N 9/10-SG { the } { report } alipiga "piga" V 1/2-SG3-SP VFIN { he/she } PAST z SVO ACT simu "simu" <IDIOM { call }, "," COMMA {, } kukaa "kaa" V INF { to } z [kaa] { sit } SV "kaa" V INF NO-TO z [kaa] { sit } SV na "na" CC { and kungoja "ngoja" V INF { to } z [ngoja] { wait } SV "ngoja" V INF NO-TO z [ngoja] { wait } SV

4. Isolating MWE ( N 1/2-SG { the } { man ) ( V 1/2-SG3- SP VFIN { he/she } PAST 1/2-SG-REL { who } z { get } ) ( N 9/10-SG { the } { report ) ( V 1/2-SG3-SP VFIN { he/she } PAST z SVO ACT <IDIOM { call } ) ( COMMA {, } ) ( V INF { to } z { sit } SV ) ( CC { and ) ( V INF { to } z { wait } )

5. Word-per-line format ( N 1/2-SG { the } { man ) ( V 1/2-SG3-SP VFIN { he/she } PAST 1/2-SG-REL { who } z { get } ) ( N 9/10-SG { the } { report ) ( V 1/2-SG3-SP VFIN { he/she } PAST z SVO ACT <IDIOM { call } ) ( COMMA {, } ) ( V INF { to } z { sit } SV ) ( CC { and ) ( V INF { to } z { wait } )

6. Copying info on serial verbs ( N 1/2-SG { the } { man ) ( V 1/2-SG3-SP VFIN PAST 1/2-SG-REL { who } z { get } ) ( N 9/10-SG { the } { report ) ( V 1/2-SG3-SP VFIN PAST z SVO ACT <IDIOM { call } ) ( COMMA {, } ) ( V 1/2-SG3-SP VFIN PAST z { sit } SV ) ( CC { and ) ( V 1/2-SG3-SP VFIN PAST z { wait } SV )

7. Construct word order ( N 1/2-SG { the } { man ) ( V 1/2-SG3-SP VFIN PAST 1/2-SG-REL { who } z { get } ) ( N 9/10-SG { the } { report ) ( V 1/2-SG3-SP VFIN PAST z SVO ACT <IDIOM { call } ) ( COMMA {, } ) ( V 1/2-SG3-SP VFIN PAST z { sit } SV ) ( CC { and ) ( V 1/2-SG3-SP VFIN PAST z { wait } SV )

8. Surface form in target language ( N 1/2-SG { the } { man ) ( V 1/2-SG3-SP VFIN PAST 1/2-SG-REL { who } z { :got } ) ( N 9/10-SG { the } { report ) ( V 1/2-SG3-SP VFIN PAST z SVO ACT <IDIOM { :called } ) ( COMMA {, } ) ( V 1/2-SG3-SP VFIN PAST z { :sat } SV ) ( CC { and ) ( V 1/2-SG3-SP VFIN PAST z { :waited } ) Translation: the man who got the report called, sat and waited

Organizing the work How should the work be organised on the continent of hundreds of languages? Prioritising languages –‘Big’ languages first due to their strategic importance –Some minor languages may have special political or scientific importance

Organizing the work Scientific infrastructure –Such as ELRA (European Language Resource Association) and –ELDA (European Language Resource Distribution Agency) Africa needs something similar An initiative was made in the LREC2006 conference in Genova to establish such an infrastructure

Organizing the work Networking extremely important –Geographical distances between actors are immense –Ensures efficient communication and distribution of ideas –Ensures that the best and tested approaches will become a standard in LT –Motivates in this tough work

Networking A Wikipedia type forum as an information and discussion centre for LT in Africa KitWiki/Community/AfricanActiviti es

KitWiki/Community/AfricanActivities Organizations, networks and activities related to LT for African languages Key Areas LT Policy LT Resources Helsinki Corpus of Swahili Corpus Of SwahiliCorpus Of Swahili LT Research and Development SALAMA - Swahili Language Manager SALAMASALAMA Nordic Journal of African Studies NJASNJAS LT Training and Education LT Legislation LT Business Activities Other Activities This topic: KitWiki/Community > WebHome > AfricanActivitiesWebHome AfricanActivities History: r Jul :03 - ArviHurskainenArviHurskainen

EDULINK initiative EU has started in 2006 to support networking between higher education institutions EDULINK-ACP-EU Cooperation Programme in Higher Education

EDULINK initiative EDULINK is the first ACP-EU Cooperation Programme in Higher Education EDULINK is financed by the European Commission under the 9th EDF and is managed by the ACP Secretariat.

EDULINK initiative EDULINK promotes networking of HEIs in ACP States and the eligible EU Member States through funding of joint projects.

EDULINK initiative: Language technology for African languages Consortium of five universities –Dar-es-Salaam –Nairobi –Ghana –Hawassa (Ethiopia) –Helsinki Associates –UNISA, Stellenbosch, SA –Trondheim

EDULINK initiative: Language technology for African languages Aims –Training in LT Workshops Training courses Summer School in LT Evaluation –Developing new LT Language corpora Morphological parsers Speech technology MT (further development of SALAMA)

Development environments Environments with property rights –Can be obtained through licensing for development purposes –Can also be available with nominal price, e.g. xfst package of Xerox –Cannot be included into the product without a separate agreement with the property owner

Development environments Open domain environments –Free for development –Free for inclusion into a product

Availability of development environments In morphology –xfst package of Xerox using finite state methods is most popular –Free for development but not free for inclusion into a product

Availability of development environments In disambiguation and syntactic mapping –CG-2 and Functional Dependency Grammar (FDG) of Connexor Only through licensing Not free for inclusion into a product

Availability of development environments In disambiguation and syntactic mapping –CG-3 is an open source product Free for developing Free for inclusion into a product ammar.html

Developing open source technology Efforts to move SALAMA to open domain

Two implementations of SALAMA Comparison of two methods for morphological analysis –Analysis using finite state method (PR) and –Analysis using two-phase method (OS)

Two implementations of SALAMA Finite state method –Good Very fast, w/s in SWATWOL Facilitates description on more than one level –Two-level description most common

Two implementations of SALAMA Finite state method –Good The use of two-level rules simplifies the structure of the dictionary The whole morphology can be described in one phase Can be used for simulating linguistic processes (good for research purposes)

Two implementations of SALAMA Finite state method –Bad Difficult in handling non- concatenative processes (does not ‘see behind’) Writing a reliable rule system is difficult In constructing the lexicon, the influence of the rules must be anticipated

Two implementations of SALAMA Finite state method –Bad Because the lexicon is a tree- structure, the whole language should be described with one single lexicon Difficulties in compiling very large lexicons No open source platform available

Two implementations of SALAMA Two-phase method - description –In the first phase, the word is described using pattern matching rules Produces meta-tags with two parts Example: –unanifundisha “fundisha” [funda] V uSP naTAM niOBJ ishaVE –uSP »u = string in the word »SP = tag meaning subject prefix

Two implementations of SALAMA Two-phase method –In the second phase, meta-tags are rewritten as final tags –uSP > –1/2-SG2-SP VFIN { you } –3/4-SG-SP VFIN { it } –11-SG-SP VFIN { it }

Two implementations of SALAMA Result after the first phase: unanifundisha “fundisha” [funda] V uSP naTAM niOBJ ishaVE Result after the second phase: unanifundisha "fundisha" [funda] V 1/2-SG2-SP VFIN { you } PR:na 1/2-SG1-OBJ { me } { teach } CAUS "fundisha" [funda] V 3/4-SG-SP VFIN { it } PR:na 1/2-SG1-OBJ { me } { teach } CAUS "fundisha" [funda] V 11-SG-SP VFIN { it } PR:na 1/2-SG1-OBJ { me } { teach } CAUS

Two implementations of SALAMA Two-phase method –Good No specific development platform needed Task divided into two phases – makes the description of each phase more manageable No compilation problems, because the system is composed of a number of separate rules, each performing a specific task

Two implementations of SALAMA Two-phase method –Good Optimal order of readings can be controlled – helps in disambiguation, No ownership restrictions The product free for distribution

Two implementations of SALAMA Two-phase method –Bad Requires fairly good programming skills Because two-level rules cannot be used for simplifying the lexicon, the lexicon becomes complex The absence of state transition (found in fst methods) increases the need for copying word stems in complex word structures, e.g. verbs in Bantu languages

Two implementations of SALAMA The complexity of the lexicon can be reduced by allowing some overproduction, which will be removed afterwards with rules that check ungrammatical tag combinations

Two implementations of SALAMA Example: reciprocal and passive extensions block the object prefix –wananifundishana (ungrammatical) “fundishana” [funda] V waSP naTAM niOBJ ishanaVE In post-processing, the string will be removed with the rule that states that niOBJ and anaVE cannot co-occur

Two implementations of SALAMA Speed in morphological analysis –Finite state method: 4500 w/s –Two-phase method: 500 w/s Speed in machine translation –Finite state method: 650 w/s –Two-phase method: 350 w/s

Two implementations of SALAMA CG-3 in disambiguation and syntactic mapping is an open domain product

Two implementations of SALAMA Rules for re-ordering the sentence structure in the target language can be written with any suitable programming language

Two implementations of SALAMA Rules for producing the surface form in the target language can be written with any suitable programming language

Where? The main responsibility for developing LT for African languages should be in those countries where the languages are mostly used –Work out and implement a plan –Provide resources (human and capital) –Make use of the know-how available globally –Networking

Language resources Text corpora –Private collections of texts by researchers –Text and speech corpora of official languages of South Africa –Helsinki Corpus of Swahili (12 m) globally available Texts corrected and edited Morphologically annotated –Gikuyu annotated corpus (de Pauw et al)

Language resources Manuscripts –SOAS (School of Oriental and African Studies, London) holds a large collection of Swahili manuscripts –Background info in the Web, but not the manuscripts themselves

Language resources Corpus compilation in cooperation with publishing houses – so far very little used More extensive use of the material available in the Web

Dictionaries Internet Swahili Dictionary (Yale University) –Free and widely used (1 mil web visits monthly) –Compiled on voluntary basis NOTE! On Sep The Internet Living Swahili Dictionary has been taken offline – at least temporarily

Dictionaries TUKI (Taasisi ya Uchunguzi wa Kiswahili) has released a CD version of the –Swahili - English and –English – Swahili dictionaries –Can be edited and used in developing language tools

Grammars Electronic grammars missing SALAMA (Swahili Language Manager) contains a comprehensive grammar of Swahili SALAMA-DC has a potential of compiling extensive dictionaries with translated use examples

Tools Spell checkers based on word lists available for a number of languages –Kilinux (Kiswahili Linux) project Spell checkers based on linguistic analysis –Orthographix 2 for Swahili (Lingsoft) –Swahili speller integrated to MS Office 2007

Tools SALAMA - A comprehensive environment for developing various kinds of tools Spell checking Information retrieval Vocabulary compilation Concordance compilation Dictionary compilation Machine translation So far based on the language in text form

Projects in progress Localization to Swahili: –Windows 2000 and Windows XP (2006) –MS Office 2003 (2005) Open Swahili Localization Project (KILINUX) –Linux to Swahili –OpenOffice to Swahili, including a Swahili spell checker Ubuntu (a basic version of Linux) to Swahili and many other languages

Technology projects SALAMA –Based on linguistic knowledge –Maximal amount of linguistic information expressed overtly and systematically –Statistical probabilities used in semantic disambiguation –Developing environment rather than a task-specific tool –Modular structure –Extensible

Technology projects SALAMA –Current status: machine translation from Swahili to English Dictionary compilation –Future plans: machine translation from English to Swahili Integration to speech-to-speech applications

Open source platforms Need: open source platforms –Initiatives do exist for developing open source platforms for morphological analysis

Government support in Africa Detailed plans on how to proceed and how to finance the work still missing South Africa better organized than other areas Initiatives for networking –Networking the Development of Language Resources for African Languages (LREC 2006, Genova) –EDULINK Initiative (EU)

Summary Atmosphere positive Towards open source solutions Special features of African languages to be taken into account Systems rather than ad hoc solutions for individual problems Networking extremely important