Inter-partition communication in multi-core

系统 1811 0

A multi-core processor includes logical partitions that have respective processor cores, memory areas, and Ethernet controllers. At least one of the Ethernet controllers is disabled for external communication and is assigned as an inter-partition Ethernet controller for inter-partition communication. The inter-partition Ethernet controller is configured in loopback mode. A transmitting partition addresses a message through a send buffer in a private memory area to the inter-partition Ethernet controller assigned to a receiving partition. The receiving inter-partition Ethernet controller copies the received message to a receive buffer in the receiving partition's memory area. The receive Ethernet controller returns the received message to the sending partition and the sending partition resumes control of the memory space of the send buffer, or alternatively, the receive Ethernet controller frees the memory space of the send buffer to the private memory of the sending partition.

BACKGROUND OF THE INVENTION

The present invention is directed to multi-core processors and, more particularly, to inter-partition communication in a multi-core processor.

A multi-core processor is a single computing component with two or more independent processor cores, which can run separate instructions in parallel, increasing overall speed. The cores may be included in a single integrated circuit (IC) or in more than one IC but in a single package. The different processor cores may run codes in the same operating system (OS) and may be scheduled to run code in parallel (symmetrical multi-processing—'SMP'), sharing common memory, provided that each task in the system is not in execution on two or more cores at the same time. SMP systems can move tasks between cores to balance the workload efficiently. Alternatively, different cores may be restricted as concerns sharing specific memory and input/output (I/O) ports and may run different code in the same OS or may run different OSs (asymmetrical multi-processing—'AMP'). A core may be dedicated to a specific OS or may be capable of working in more than one OS or may even run without an OS.

Multi-core processors may be used in many applications such as general-purpose embedded computing systems, embedded and network communications including routers, switches, media gateways, base station and radio network controllers, digital signal processing (DSP), and graphics and video processing, for example. Multi-core processors typically contain many Ethernet controllers, which are level 2 (L2) devices in the Open Systems Interconnection (OSI) model. These Ethernet controllers are usually configurable to be connected to any of the Physical layer (PHY) devices present in the multi-core processors. It is common to have more Ethernet controllers than the number of PHY devices.

A multi-core processor may include two or more logical partitions, usually each hosting a separate instance of an OS. Logical partitioning divides hardware resources so that specific cores, memory areas and I/O ports are allocated to the different partitions. The interaction between the partitions and the applications running may be managed by a hypervisor. A hypervisor organizes a virtual operating platform and manages the execution of multiple "guest" OSs running in parallel on the processor. Several guest OSs may share the virtualized hardware resources.

Communication is typically necessary between partitions, referred to as inter-partition communication. Inter-partition communication may take the form of messages or calls and may involve exchange of data and/or exchange of control signaling. Inter-partition communication may be implemented through memory area shared between the sending and receiving partitions. However, memory sharing reduces isolation of the partitions and increases risks to security, especially if the inter-partition communication opens up direct private memory access between the partitions. Also there is a risk of starvation where a partition(s) over allocates from the shared memory or loses or never frees up previously allocated memory. Sharing of memory makes system recovery complex in case of failure of partition(s) in the system. Such risks can be managed if a hypervisor is provided and mediates every inter-partition communication, but making hypervisor calls imposes overhead and can make communication slow. Thus, it would be advantageous to have a method for inter-partition communication that does not rely on shared memory.

BRIEF DESCRIPTION OF THE DRAWINGS

The present invention is illustrated by way of example and is not limited by embodiments thereof shown in the accompanying figures, in which like references indicate similar elements. Elements in the figures are illustrated for simplicity and clarity and have not necessarily been drawn to scale.

FIG. 1 is a schematic block diagram of a known multi-core processor showing three logical partitions and a hypervisor;

FIG. 2 is a schematic block diagram of the multi-core processor of FIG. 1showing memory cache management and on-chip and peripheral communication connections and management;

FIG. 3 is a schematic diagram of inter-partition communication in part of a multi-core processor of the kind shown in FIGS. 1 and 2 in accordance with one embodiment of the invention, given by way of example;

FIG. 4 is a schematic diagram of inter-partition communication in part of a multi-core processor of the kind shown in FIGS. 1 and 2 in accordance with another embodiment of the invention, given by way of example; and

FIG. 5 is a flow chart of a method of operating a multi-core processor such as that shown in FIG. 3 or  4  in accordance with one embodiment of the invention, given by way of example.

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS

FIG. 1 illustrates a known multi-core processor  100  with logical partitions  102 , 104  and  106  and a hypervisor  108  managing execution of instruction code by system hardware  110 . The multi-core processor  100  is shown with three logical partitions but it will be appreciated that a different number of partitions may be provided. The logical partitions  102 104  and  106  have respective processor cores such as  112 114 116  and  118 , private memory areas such as  120 122 and  124 , and input/output ('I/O') ports such as  126 128 130  and  132 . The system hardware  110  also includes memory area  134  shared between the logical partitions  102 104  and  106 , a common I/O port  136 , shared memory cache  138 , I/O memory management unit (MMU)  140  and an interrupt controller  142 .

Inter-partition communication in multi-core processor

The multi-core processor  100  may run application codes in the same operating system (OS) and may be scheduled to run a code in parallel, symmetrical multi-processing mode (SMP). In this example, the multi-core processor  100  is shown with the different logical partitions  102 104  and  106  running application codes in different guest OS's in asymmetrical multi-processing mode (AMP), with the logical partition  102  executing Linux® OS, the logical partition  104  executing a third party real time OS (RTOS) and the logical partition  106  executing a lightweight executive (LWE) OS. The hypervisor  108  presents to the guest operating systems a virtual operating platform and manages the execution of the guest operating systems. The LWE OS provides run-to-completion data plane processing, so that processes do not pre-empt each other and each process must run to completion before other processes get a chance to run.

FIG. 2 illustrates in more detail hardware  200  and associated management layers in the multi-core processor  100 . The hardware  200  includes a set  202  of cores such as  112  to  118  with associated private memory caches. A Corenet™ coherency fabric  204  manages coherency of the memory caches and provides on-chip and peripheral communication connections and management that supports concurrent traffic and eliminates single-point bottlenecks for non-competing resources. The Corenet™ coherency fabric  204  avoids contention and latency issues associated with scaling shared bus/shared memory architectures. The management layers for the hardware  200  include accelerator, encryption and power management modules  206 , a buffer manager  208 , a queue manager  209 , common memory caches  210 , such as the shared memory  134 , memory controllers  212  and local bus controllers and interrupt control modules  214 .

Inter-partition communication in multi-core processor

The hardware  200  also includes two I/O modules which include respective frame managers  216  and  218 , respective 10 Gb/s Ethernet controllers  220  and  222 , and respective sets  224  and  226  of four 1 Gb/s Ethernet controllers. In addition the hardware  200  includes an on-chip network  228  with a peripheral component interconnect (PCI) interface  230 , a message I/O unit  232 , a serial I/O unit  234  and a direct memory access (DMA) unit  236 . A debug I/O unit  238  is provided for development and test work. All the I/O modules connect with a Serializer/Deserializer (SerDes)  240  for external communication, having blocks which convert data between serial data lanes and parallel interfaces in each direction. The SerDes  240  has 18 serial data lanes in this example. There may be one or more of the Ethernet controllers such as  220  to 226  which are surplus and are kept disabled because there are not enough SERDES lanes to connect to them.

In operation, inter-partition communication in one known multi-core processor of the kind shown in FIG. 1 is performed using a specific virtual I/O adapter interface and I/O operation program. The I/O operation program is under the control of supervisor/hypervisor software that initializes the message routing tables in the adapter but the supervisor/hypervisor calls impose overhead and can make communication slow. The known inter-partition communication uses a data movement protocol which is dictated by the virtual I/O adapter.

FIGS. 3,  4  and  5  illustrate part of multi-core processors  300  and  400  and a method  500  of operating a multi-core processor in accordance with embodiments of the invention, given by way of example. Each of the multi-core processors  300  and  400 includes a plurality of logical partitions  302  and  304  that have respective processor cores  306  and  308 , memory areas  310 and  312 , and Ethernet controllers  314  and  316 . The method  500  includes disabling for external communication at least one Ethernet controller  314  of the logical partitions  302  and  304  and assigning the Ethernet controller disabled for external communication as an inter-partition Ethernet controller for reception of inter-partition communication.

In an example of the method  500 , the inter-partition Ethernet controller  314  is configured in loopback mode in which it works as a DMA device in the receive mailbox of the receiving partition  302 . In an example of the method  500 , one of the logical partitions is configured as a transmitting partition  302  and another of the logical partitions as a receiving partition  304  having respective private memory areas. The transmitting partition  302  addresses a message MBUF through a send buffer  318  in its private memory area  312  to the inter-partition Ethernet controller  314  assigned to the receiving partition  302 , under the management of the queue manager  209  for example. The receiving partition  302  copies the message MSG received in its inter-partition Ethernet controller  314  into a receive buffer  320  in the private memory area  310  of the receiving partition.

In this example of the method  500 , and as illustrated in FIG. 3, the inter-partition Ethernet controller  314  of the receiving partition  302  is configured in loopback mode and after copying the received message into said receive buffer returns the received message MSG to the application  324  in the sending partition  304 , under the management of the queue manager 209 , notifying the sending partition of reception of the message, and the sending partition then resumes control of the memory space of the send buffer and can re-use the send buffer  318  for other messages.

Inter-partition communication in multi-core processor

In another example of the method  500 , and as illustrated in FIG. 4, the inter-partition Ethernet controller  314  of the receiving partition  302  is configured in loopback mode and frees the memory space of the send buffer  318  to the private memory  312 of the sending partition after copying the received message MSG into the receive buffer  320 , by instructing the buffer manager  208 .

Inter-partition communication in multi-core processor

In this example of the method  500 , the receiving partition  302  copying the message MSG received in its inter-partition Ethernet controller  314  into a receive buffer  320  includes the receiving partition  302  allocating the receive buffer  302  from its memory area  310  and its inter-partition Ethernet controller  314  copying the message MSG received into the allocated receive buffer  302 , the inter-partition Ethernet controller  314  and the receive buffer  302  forming a receive mailbox for the inter-partition message.

In more detail, in this example of the multi-core processors  300  and  400 , application codes  322  and  324  are running in the partitions  302  and  304  under the LWE OS and the multi-core processor  300  is enabled for reconfigurable data path acceleration (DPA) operation. Ethernet controllers that are disabled for external communication, for example if the current device configuration leaves insufficient SerDes lanes for those Ethernet controllers, or because of device errata, are identified. A respective one of the Ethernet controllers identified as suitable for the inter-partition communication is then assigned to each of the logical partitions  302  and  304  in loopback mode. The buffer manager  208  can then allocate receive buffers such as  318  and  320  for the logical partitions  302  and  304  from their private memory  310  and  312 , forming mailboxes for messages.

When the sending partition  304  defines a message to be sent to the receiving partition  302 , the sending partition  304 instructs the buffer manager  208  to allocate space in its private memory  312  to the send buffer  318 , as shown by the arrows  326 . The message to be sent MBUF is registered in the send buffer  318  and queued through the queue manager 209  to the inter-partition mailbox receive Ethernet controller  314 , as shown by the arrows  328 . The receive Ethernet controller  314  in the receiving partition  302  then instructs the buffer manager  208  to allocate space in its private memory 310  for the receive buffer  320 , as shown by the arrows  330  and copies the received message MSG into the receive buffer 320  as message MBUF, as shown by the arrow  332 . In the processor  300 , the receive Ethernet controller  314  then returns the received message MSG to the application  324  in the sending partition  304  under the management of the queue manager  209 , notifying the sending partition of reception of the message, and the sending partition  304  then resumes control of the memory space of the send buffer  318 . Alternatively, in the processor  400 , the receive Ethernet controller  314 then instructs the buffer manager  208  to free the send buffer  318  to the private memory  312  of the sending partition  304 , as shown by the arrow  402 . In each case, the receive Ethernet controller  314  then notifies the application  322  of the received message MBUF, as shown by the arrow  336 .

The method  500  of operating a multi-core processor is summarized in the simplified flow chart of FIG. 5. The method starts at  502 . At  504 , the partitions and OSs of the multi-core processor are configured. At  506 , Ethernet controllers  314  and  316 which are disabled for external communication are identified and receive Ethernet controllers  314  are configured in loopback mode at  508 . At  510 , the sending partition  304  instructs the buffer manager  208  to allocate space in its private memory  312 for the send buffer  318 . At  512 , the application  324  of the sending partition queues the message MBUF through the queue manager  209  to the receive Ethernet controller  314 . The receive Ethernet controller  314  instructs the buffer manager  208  to allocate space in its private memory  310  for the receive buffer  320  at  514 . At  516 , the receive Ethernet controller  314 copies the received message MSG into the receive buffer  320 . At  518 , either the receive Ethernet controller  314  returns the received message MSG through the queue manager  209  to the sending partition  304  as loopback and the sending partition 304  then resumes control of the memory space of the send buffer  318  or the receive Ethernet controller  314  instructs the buffer  manager  208  to free the send buffer  318  to the private memory  312  of the sending partition  304 . At  520 , the receive Ethernet controller  314  notifies the application  322  of the received message MBUF and the method ends at  522 .

Inter-partition communication in multi-core processor

It will be appreciated that no cycles of core processing time are necessary to copy the message for the receiving partition 302 . Also the inter-partition communication does not involve cycles of hypervisor time, and the allocation of buffers as mailboxes with the receive Ethernet controller is performed by the receiving partition  302  instructing the buffer manager  208 . The Ethernet controllers  314  and  316  used for inter-partition communication are available in the processor  300  and are not specific hardware for inter-partition communication. The operation of copying the received message MSG into the receivebuffer  320  can be performed by any suitable data movement protocol. More than one buffer can be allocated to receive mailboxes by a receiving partition  302 , if desired. Access control to sender partition memory can be enforced using Input Output Memory Management Unit ('IOMMU') enabling the receiver partition to allow its inter-partition-mailbox Ethernet port to copy messages selectively from a sending partition.

The invention may also be implemented using at least portions of processor code for performing steps of a method according to the invention when run on a programmable apparatus, such as a computer system or enabling a programmable apparatus to perform functions of a device or system according to the invention. A computer program is a list of instructions such as a particular application program and/or an operating system in processor code. The computer program may for instance include one or more of: a subroutine, a function, a procedure, an object method, an object implementation, an executable application, an applet, a servlet, a source code, an object code, a shared library/dynamic load library and/or other sequence of instructions designed for execution on a computer system.

The computer program may be stored internally on computer readable storage medium or transmitted to the computer system via a computer readable transmission medium. All or some of the computer program may be provided on computer readable media permanently, removably or remotely coupled to an information processing system. The computer readable media may include, for example and without limitation, any number of the following: magnetic storage media including disk and tape storage media; optical storage media such as compact disk media (e.g., CD-ROM, CD-R, etc.) and digital video disk storage media; nonvolatile memory storage media including semiconductor-based memory units such as FLASH memory, EEPROM, EPROM, ROM; ferromagnetic digital memories; MRAM; volatile storage media including registers, buffers or caches, main memory, RAM, etc.; and data transmission media including computer networks, point-to-point telecommunication equipment, and carrier wave transmission media, just to name a few.

In the foregoing specification, the invention has been described with reference to specific examples of embodiments of the invention. It will, however, be evident that various modifications and changes may be made therein without departing from the broader spirit and scope of the invention as set forth in the appended claims.

The connections as discussed herein may be any type of connection suitable to transfer signals from or to the respective nodes, units or devices, for example via intermediate devices. Accordingly, unless implied or stated otherwise, the connections may be direct connections or indirect connections. The connections may be illustrated or described in reference to being a single connection, a plurality of connections, unidirectional connections, or bidirectional connections. However, different embodiments may vary the implementation of the connections. For example, separate unidirectional connections may be used rather than bidirectional connections and vice versa. Also, a plurality of connections may be replaced with a single connection that transfers multiple signals serially or in a time multiplexed manner. Likewise, single connections carrying multiple signals may be separated out into various different connections carrying subsets of these signals. Therefore, many options exist for transferring signals.

SRC= https://www.google.com.hk/patents/US20130227243

Inter-partition communication in multi-core processor


更多文章、技术交流、商务合作、联系博主

微信扫码或搜索:z360901061

微信扫一扫加我为好友

QQ号联系: 360901061

您的支持是博主写作最大的动力,如果您喜欢我的文章,感觉我的文章对您有帮助,请用微信扫描下面二维码支持博主2元、5元、10元、20元等您想捐的金额吧,狠狠点击下面给点支持吧,站长非常感激您!手机微信长按不能支付解决办法:请将微信支付二维码保存到相册,切换到微信,然后点击微信右上角扫一扫功能,选择支付二维码完成支付。

【本文对您有帮助就好】

您的支持是博主写作最大的动力,如果您喜欢我的文章,感觉我的文章对您有帮助,请用微信扫描上面二维码支持博主2元、5元、10元、自定义金额等您想捐的金额吧,站长会非常 感谢您的哦!!!

发表我的评论
最新评论 总共0条评论